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NOVEL BONE MARROW NUCLEIC ACIDS AND POLYPEPTIDES 



1. BACKGROUND OF THE INVENTION 
1.1 TECHNICAL FIELD 

The present invention provides novel bone marrow-expressed polynucleotides and bone 
marrow-expressed proteins encoded by such polynucleotides, along with uses for these 
polynucleotides and proteins, for example in therapeutic, diagnostic . and research methods. 

12 BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such as 
lymphokines, interferons, CSFs, chemokines, and interleukins) has matured rapidly over the past 
decade. The now routine hybridization cloning and expression cloning techniques clone novel 
polynucleotides "directly" in the sense that they rely on information directly related to the 
discovered protein (i.e., partial DNA/amino acid sequence of the protein in the case of 
hybridization cloning; activity of the protein in the case of expression cloning). More recent 
"indirect" cloning techniques such as signal sequence cloning, which isolates DNA sequences 
based on the presence of a now well-recognized secretory leader sequence motif, as well as 
various PCR-based or low stringency hybridization-based cloning techniques, have advanced the 
state of the art by making available large numbers of DNA/amino acid sequences for proteins 
that are known to have biological activity, for example, by virtue of their secreted nature in the 
case of leader sequence cloning, by virtue of their cell or tissue source in the case of PCR-based 
techniques, or by virtue of structural similarity to other genes of known biological activity. 

The bone marrow is a well-organized tissue located within the central cavity of bone. It 
has a complex three-dimensional structure that is richly innervated and highly vascularized. Two 
primary cell types make up the structure of the bone marrow. These are the stromal, and 
parenchymal cells. Stromal cells include reticular cells such as fibroblasts, endothelial cells, 
adipocytes, as well as cells of the osteochondroge^ic lineage. They exert important influences 
on osteoclastogenesis and lymphopoiesis; and have additional effects on bone turnover. 
Parenchymal cells are comprised of the hematopoietic cells, and are important for proliferation, 
maturation, and migration of cells that make up the blood. 

In the adult, hematopoiesis takes place primarily in the bone marrow. Therefore, all of 
the cells that make up the blood, such as erythrocytes, platelets, basophils, natural killer cells, 
eosinophils, T- and B-lymphocytes, neutrophils, macrophages, and others, are produced in this 
structure. Each of these cells is derived from a common, self-renewing stem cell that 
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proliferates, and/or differentiates depending on regulatory molecules that are procJaced bytfie" 1 ^ 

stromal cells. Stromal cells are predominantly a mixture of fibroblasts, macrophage/dendritic 

lineage cells, epithelial cells, and endothelial cells. They influence the fate of hematopoietic 

cells through the secretion of soluble factors, cytokines, and the expression of membrane- 

5 anchored growth factors, and cell surface recognition molecules. 

Cytokines are necessary for normal hematopoiesis in the bone marrow, and provide a 

means of fine-tuning bone marrow function in response to stimulation. They are not only 

produced by stromal cells, but can also be secreted by macrophages, and antigen-stimulated T 

lymphocytes for the purpose of replenishing leukocytes that may be consumed during immune 

10 and inflammatory reactions. Many cytokines that influence the differentiation and expansion of 
hematopoietic progenitor cells are termed colony-stimulating factors, because they were initially 
assayed by their ability to stimulate the formation of cell colonies in bone marrow cultures. 
Some of these colony-stimulating factors (CSFs) include, granulocyte-CSF, 
granulocyte/macrophage-CSF, monocyte-CSF, Kit-Iigand, interleukin (IL)-6, FLK-2 ligand, and 

1 5 leukemia inhibitory factor. Each of these stimulates the growth and development of various 

leukocytic or erythroid colonies. Other cytokines secreted in the bone marrow include IL-9, a T 
cell line and mast cell progenitor-stimulating factor, IL-1 1, a megakaryocytopoiesis stimulator, 
and IL-7, a cytokine that influences the survival and expansion of immature precursors 
committed to the B and T cell lineages. Many other cytokines are also secreted in the bone 

20 marrow. 

Cell-surface molecules that represent several adhesion molecule superfamilies including 
integrins, selectins, sialomucins and the immunoglobulin domain-containing proteins, are 
important in supporting cell-cell and cell-extracellular matrix interactions in the bone marrow. 
These proteins are critical to the homing of progenitor cells selectively to the marrow stroma for 

25 proliferation and differentiation. They also serve to influence the fate of the progenitor cells by 
directing them to differentiate into a specific lineage. For example, VLA-4 directs control of late 
erythroid differentiation and pro-B cell maturation. 

The bone marrow is also the site of B cell development. B cells begin as lymphoid stem 
cells that differentiate into progenitor B-cells, or pro-B cells. Pro-B cells proliferate within the 

30 bone marrow, and fill the extravascular spaces between large sinusoids in the shaft of the bone. 
They next mature into precursor B cells, pre-B cells. The stromal cells of the bone marrow are 
crucial for both pro- and pre-B cell development because they provide a source of cytokines, and 
a substrate for direct interaction with the pro- and pre-B cells. Pro-B cells require interaction 
with VCAM-1 and stem-cell factor (SCF) on the stromal cells to induce expression of the IL-7 

35 receptor. Secretion of IL-7 by the stromal cells then induces the pro-B ceUs to mature into pre-B 
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cells. Continued IL-7 secretion by stromal cells induces pre-B cells to begin proliferating and 
eventually differentiates them into immature B-cells. In addition, a selection process within the 
bone marrow eliminates B cells with self-reactive phenotypes, functioning to protect against 
autoimmune disease. 

5 The bone marrow environment also influences bone turnover and bone precursor cell 

functions. Bone marrow stromal cells include the precursors of the osteochondrogenic lineage, 
and can modulate the effects of some systemic factors on bone turnover. Furthermore, 
hematopoietic cells may influence the differentiation of osteogenic cells, and mature 
lymphocytes may impact osteoclastic and osteoblastic functions. For instance, B-lymphocytes 

10 have been implicated in the secretion of factors that change the immunological milieu at sites of 
new bone induction and influence new bone formation. 

The identified bone marrow-expressed polynucleotide and polypeptide sequences may 
have applications in hematopoiesis, stem cell survival, and bone growth and remodeling. 
Identification of secreted factors that stimulate hematopoiesis may serve to produce greater 

1 5 immune responses in immunosuppressed individuals. The identification of factors that 
preferentially stimulate specific hematopoietic cell types may also allow the prevention of 
specific disorders such as anemia in the case erythroid cell stimulating factors, or platelet 
deficiency in the case of megakaryocyte stimulating factors. Likewise, stem cell stimulating 
factors may be used to restore blood cell populations following chemotherapy treatments for 

20 cancer. Therapy to stimulate bone healing and remodeling may also be identified by the 

discovery of novel factors in the bone marrow that influence bone resorption by osteoclasts, or 
new bone cell differentiation from stromal cells. 



2. SUMMARY OF THE INVENTION 

25 The compositions of the present invention include novel isolated polypeptides from bone 

marrow tissue, and novel isolated polynucleotides from bone marrow tissue encoding such 
polypeptides, including recombinant DN A molecules, cloned genes or degenerate variants thereof, 
especially naturally occurring variants such as allelic variants, antisense polynucleotide molecules, 
and antibodies that specifically recognize one or more epitopes present on such polypeptides, as 

30 well as hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including expression 
vectors, containing the polynucleotides of the invention, cells genetically engineered to contain such 
polynucleotides and cells genetically engineered to express such polynucleotides. 

The present invention relates to a collection or library of at least one novel nucleic acid 

35 sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
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hybridization (SBH), and in some cases, sequences obtained from one or more public databases. 

The invention relates also to the proteins encoded by such polynucleotides, along with therapeutic, 

diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid 

sequences are designated as SEQ ID NO: 1-84, or 1 68-25 1 are provided in the Sequence Listing. In 

5 the nucleic acids provided in the Sequence Listing, A is adenine; C is cytosine; G is guanosine; T is 

thymine; and N is any of the four bases. In the amino acids provided in the Sequence Listing, * 

corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences that 

hybridize to the complement of SEQ ID NO : 1-84, or 168-251 under stringent hybridization 

1 0 conditions; nucleic acid sequences which are allelic variants or species homologues of any of the 
nucleic acid sequences recited above, or nucleic acid sequences that encode a peptide comprising a 
specific domain or truncation of the peptides encoded by SEQ ID NO: 1 -84, or 1 68-25 1 . A 
polynucleotide comprising a nucleotide sequence having at least 90% identity to an identifying 
sequence of SEQ ID NO: 1 -84, or 1 68-25 1 or a degenerate variant or fragment thereof. The 

1 5 identifying sequence can be 1 00 base pairs in length. 

The nucleic acid sequences of the present invention also include the sequence information 
from the nucleic acid sequences of SEQ ID NO: 1 -84, or 1 68-25 1 . The sequence information can 
be a segment of any one of SEQ ID NO: 1 -84, or 1 68-25 1 that uniquely identifies or represents the 
sequence information of SEQ ID NO: 1 -84, or 1 68-25 1 . 

20 A collection as used in this application can be a collection of only one polynucleotide. The 

collection of sequence information or identifying information of each sequence can be provided on 
a nucleic acid array. In one embodiment, segments of sequence information are provided on a 
nucleic acid array to detect the polynucleotide that contains the segment. The array can be designed 
to detect full-match or mismatch to the polynucleotide that contains the segment. The collection 

25 can also be provided in a computer-readable format 

This invention also includes the reverse or direct complement of any of the nucleic acid 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and 
host cells or organisms transformed with these expression vectors. Nucleic acid sequences (or their 
reverse or direct complements) according to the invention have numerous applications in a variety 

30 of techniques known to those skilled in the art of molecular biology, such as use as hybridization 
probes, use as primers for PCR, use in an array, use in computer-readablemedia, use in sequencing 
full-length genes, use for chromosome and gene mapping, use in the recombinant production of 
protein, and use in the generation of anti-sense DN A or RNA, their chemical analogs and the like. 
In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1 -84, or 1 68-25 1 , or 

3 5 novel segments or parts of the nucleic acids of the invention are used as primers in expression 
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assays that are well known in the art. In a particularly preferred embodiment, the nucleic acid 

sequences of SEQ ID NO: 1 -84, or 1 68-251 or novel segments or parts of the nucleic acids provided 

herein are used in diagnostics for identifying bone marrow tissues and cells; for identifying 

expressed genes or, as well known in the art and exemplified by Vollrath et al., Science 258:52-59 

5 ( 1 992), as expressed sequence tags for physical mapping of the human genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 

polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1 -84, or 

1 68-25 1 ; a polynucleotide comprising any of the full length protein coding sequences of SEQ ID 

NO: 1 -84, or 1 68-25 1 ; and a polynucleotide comprising any of the nucleotide sequences of the 

1 0 mature protein coding sequences of SEQ ID NO: 1 -84, or 1 68-25 1 . The polynucleotides of the 
present invention also include, but are not limited to, a polynucleotide that hybridizes under 
stringent hybridization conditions to (a) the complement of any one of the nucleotide sequences set 
forth in SEQ ID NO: 1-84, or 1 68-25 1 ; (b) a nucleotide sequence encoding any one of the amino 
acid sequences comprising SEQ ID NO: 85-1 67, or 252-335; (c) a polynucleotide which is an allelic 

1 5 variant of any polynucleotides recited above; (d) a polynucleotide which encodes a species homolog 
(e.g. orthologs) of any of the proteins recited above; or (e) a polynucleotide that encodes a 
polypeptide comprising a specific domain or truncation of any of the polypeptides comprising an 
amino acid sequence set forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 

20 comprising any of the amino acid sequences set forth in the Sequence Listing; or the corresponding 
full length or mature protein. Polypeptides of the invention also include polypeptides with 
biological activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence 
set forth in SEQ ID NO: 1-84, or 168-251; or (b) polynucleotides that hybridize to the complement 
of the polynucleotides of (a) under stringent hybridization conditions, or (c) polypeptides 

25 comprising any of the polypeptide sequences set forth in SEQ ID NO: 85-1 67, or 252-335. 
Biologically or immunologically active variants of any of the polypeptide sequences in the 
Sequence Listing, and "substantial equivalents" thereof (e.g., with at least about 65%, 70%, 75%, 
80% y 85%, 90%, 95%, 98% or 99% amino acid sequence identity) that preferably retain biological 
activity are also contemplated. The polypeptides of the invention may be wholly or partially 

3 0 chemically synthesized but are preferably produced by recombinant means using the genetically . 
engineered cells (e.g. host cells) of the invention. The polypeptides may have the initial methionine 
(Met) removed. 

The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a 
35 hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 
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The invention also provides host cells transfonned or transfected with a polynucleotide of 

the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
5 under conditions permitting expression of the desired polypeptide, and purifying the polypeptide 
from the culture or from the host cells. Preferred embodiments include those in which the 
protein produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety of 
techniques known to those skilled in the art of molecular biology. These techniques include use 

10 as hybridization probes, use as oligomers, or primers, for PCR, use for chromosome and gene 
mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA 
or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is 
largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used 
as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample 

1 5 using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
expressed sequence tags for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

20 The polypeptides according to the invention can be used in a variety of conventional 

procedures and methods that are currently applied to other proteins. For example, a polypeptide 
of the invention can be used to generate an antibody that specifically binds the polypeptide. Such 
antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the 
polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight 

25 markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical condition 
which comprises the step of administering to a mammalian subject a therapeutically effective 
amount of a composition comprising a polypeptide of the present invention and a 
pharmaceutically acceptable carrier. 

30 In particular, the polypeptides and polynucleotides of the invention can be utilized, for 

example, in methods for the prevention and/or treatment of disorders involving aberrant protein 
expression or biological activity. 

The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, be 

35 utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the 
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identification of subjects exhibiting a predisposition to such conditions. The invention provides 

a method for detecting the polynucleotides of the invention in a sample, comprising contacting 

the sample with a compound that binds to and forms a complex with the polynucleotide of 

interest for a period sufficient to form the complex and under conditions sufficient to form a 

5 complex and detecting the complex such that if a complex is detected, the polynucleotide of 

interest is detected. The invention also provides a method for detecting the polypeptides of the 

invention in a sample comprising contacting the sample with a compound that binds to and forms 

a complex with the polypeptide under conditions and for a period sufficient to form the complex 

and detecting the formation of the complex such that if a complex is formed, the polypeptide is 

10 detected. 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 
antibodies, and optionally quantitative standards, for carrying out methods of the invention. 
Furthermore, the invention provides methods for evaluating the efficacy of drugs, and 
monitoring the progress of patients, involved in clinical trials for the treatment of disorders as 
15 recited above. 

The invention also provides methods for the identification of compounds that modulate 
(i.e., increase or decrease) the expression or activity of the polynucleotides and/or polypeptides 
of the invention. Such methods can be utilized, for example, for the identification of compounds 
that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are 

20 not limited to, assays for identifying compounds and other substances that interact with (e.g. , 
bind to) the polypeptides of the invention. The invention provides a method for identifying a 
compound that binds to the polypeptides of the invention comprising contacting the compound 
with a polypeptide of the invention in a cell for a time sufficient to form a 
polypeptide/compound complex, wherein the complex drives expression of a reporter gene 

25 sequence in the cell; and detecting the complex by detecting the reporter gene sequence 

expression such that if expression of the reporter gene is detected the compound the binds to a 
polypeptide of the invention is identified. 

The methods of the invention also provide methods for treatment that involve the 
administration of the polynucleotides or polypeptides of the invention to individuals exhibiting 

30 symptoms or tendencies. In addition, the invention encompasses methods for treating diseases or 
disorders as recited herein comprising administering compounds and other substances that 
modulate the overall activity of the target gene products. Compounds and other substances can 
affect such modulation either on the level of target gene/protein expression or target protein 
activity. 
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The polypeptides of the present invention and the polynucleotides encoding them are also 

useful for the same functions known to one of skill in the art as the polypeptides and 

polynucleotides to which they have homology (set forth in Tables 1 A-D); for which they have a 

signature region (as set forth in Table 2); or for which they have homology to a gene family (as 

5 set forth in Tables 3). If no homology is set forth for a sequence, then the polypeptides and 

polynucleotides of the present invention are useful for a variety of applications, as described 

herein, including use in increasing hematopoiesis, stem cell survival, and bone growth and 

remodeling. 

3. DETAILED DESCRIPTION OF THE INVENTION 

10 

3.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular forms "a", 
"an" and "the" include plural references unless the context clearly dictates otherwise. 

The term "active" refers to those forms of the polypeptide that retain the biologic and/or 
1 5 immunologic activities of any naturally occurring polypeptide. According to the invention, the 
terms "biologically active" or "biological activity" refer to a protein or peptide having structural, 
regulatory or biochemical functions of a naturally occurring molecule. Likewise 
"immunologically active" or "immunological activity" refers to the capability of the natural, 
recombinant or synthetic polypeptide to induce a specific immune response in appropriate 
20 animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are engaged in 
extracellular or intracellular membrane trafficking, including the export of secretory or 
enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
25 polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 

complementary sequence 3'-TCA-5\ Complementarity between two single-stranded molecules 
may be "partial" such that only some of the nucleic acids bind or it may be "complete" such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between the nucleic acid strands has significant effects on the efficiency and 
30 strength of the hybridization between the nucleic acid strands. 

The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 
differentiated cell types in an embryo or an adult, including the germ cells. The term "germ line 
stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a steady 
and continuous source of germ cells for the production of gametes. The term "primordial germ 
35 cells (PGCs)" refers to a small population of cells set aside from other cell lineages particularly 
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from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to 

differentiate into germ cells and other cells. PGCs are the source from which GSCs and ES cells 

are derived. The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells 

not only populate the germ line and give rise to a plurality of terminally differentiated cells that 

5 comprise the adult specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides that 

modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 

sequence" when the expression of the sequence is altered by the presence of the EMF. EMFs 

10 include, but are not limited to, promoters, and promoter modulating sequences (inducible 
elements). One class of EMFs is nucleic acid fragments that induce the expression of an 
operably linked ORF in response to a specific regulatory factor or physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
"oligonucleotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 

1 5 sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic 
origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material. It is 
contemplated that where the polynucleotide is RNA, the T (thymine) in the sequences provided 
herein is substituted with U (uracil). Generally, nucleic acid segments provided by this invention 

20 may be assembled from fragments of the genome and short oligonucleotide linkers, or from a 
series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic acid 
which is capable of being expressed in a recombinant transcriptional unit comprising regulatory 
elements derived from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 

25 "segment" or "probe" or "primer" are used interchangeably and refer to a sequence of nucleotide 
residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, 
more preferably at least about 9 nucleotides, more preferably at least about 1 1 nucleotides and 
most preferably at least about 17 nucleotides. The fragment is preferably less than about 500 
nucleotides, preferably less than about 200 nucleotides, more preferably less than about 1 00 

30 nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 
nucleotides. Preferably the probe is from about 6 nucleotides to about 200 nucleotides, 
preferably from about 15 to about 50 nucleotides, more preferably from about 17 to 30 
nucleotides and most preferably from about 20 to 25 nucleotides. Preferably the fragments can 
be used in polymerase chain reaction (PCR), various hybridization procedures or microarray 

3 5 procedures to identify or amplify identical or related parts of mRN A or DNA molecules. A 
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fragment or segment may uniquely identify each polynucleotide sequence of the present 

invention. Preferably the fragment comprises a sequence substantially similar to any one of SEQ 

ID NO: 1-84, or 168-251. 

Probes may, for example, be used to determine whether specific mRNA molecules are 

5 present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as 

described by Walsh et al. (Walsh, P.S. et aL, 1992, PCR Methods Appl 1 :241-250). They may 

be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods well known in the 

art Probes of the present invention, their preparation and/or labeling are elaborated in 

Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 

10 Laboratory, NY; or Ausubel, F.M. et al, 1989, Current Protocols in Molecular Biology, John 

Wiley & Sons, New York NY, both of which are incorporated herein by reference in their 

entirety. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NO: 1-84, or 1 68-25 1 . The sequence 

15 information can be a segment of any one of SEQ ID NO: 1-84, or 168-25 1 that uniquely 

identifies or represents the sequence information of that sequence of SEQ ID NO: 1-84, or 168- 
251. One such segment can be a twenty-mer nucleic acid sequence because the probability that a 
twenty-mer is fully matched in the human genome is 1 in 300. In the human genome, there are 
three billion base pairs in one set of chromosomes. Because 4 20 possible twenty-mers exist, there 

20 are 300 times more twenty-mers than there are base pairs in a set of human chromosome. Using 
the same analysis, the probability for a seventeen-mer to be frilly matched in the human genome 
is approximately 1 in 5. When these segments are used in arrays for expression studies, fifteen- 
mer segments can be used. The probability that the fifteen-mer is fully matched in the expressed 
sequences is also approximately one in five because expressed sequences comprise less than 

25 approximately 5% of the entire genome sequence. 

Similarly, when using sequence information for detecting a single mismatch, a segment can 
be a twenty-five men The probability that the twenty-five mer would appear in a human genome 
with a single mismatch is calculated by multiplying the probability for a frill match (1^4 25 ) times the 
increased probability for mismatch at each nucleotide position (3 x 25). The probability that an 

30 eighteen mer with a single mismatch can be detected in an array for expression studies is 
approximately one in five. The probability that a twenty-mer with a single mismatch can be 
detected in a human genome is approximately one in five. 

The term "open reading frame," ORF, means a series of nucleotide triplets coding for 
amino acids without any termination codons and is a sequence translatable into protein. 

10 
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The terms "operably linked" or "operably associated" refer to functionally related nucleic 

acid sequences. For example, a promoter is operably associated or operably linked with a coding 

sequence if the promoter controls the transcription of the coding sequence. While operably 

linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic 

5 elements e.g. repressor genes are not contiguously linked to the coding sequence but still control 

transcription/translation of the coding sequence. 

The term "pluripotent" refers to the capability of a cell to differentiate into a number of 

differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its 

differentiation capability in comparison to a totipotent cell. 

10 The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, 

peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or 
synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a stretch of amino 
acid residues of at least about 5 amino acids, preferably at least about 7 amino acids, more 
preferably at least about 9 amino acids and most preferably at least about 17 or more amino 

15 acids. The peptide preferably is not greater than about 200 amino acids, more preferably less 
than 150 amino acids and most preferably less than 100 amino acids. Preferably the peptide is 
from about 5 to about 200 amino acids. To be active, any polypeptide must have sufficient 
length to display biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by cells that 

20 have not been genetically engineered and specifically contemplates various polypeptides arising 
from post-translational modifications of the polypeptide including, but not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

The term "translated protein coding portion" means a sequence that encodes for the full- 
length protein which may include any leader sequence or any processing sequence. 

25 The term "mature protein coding sequence" means a sequence that encodes a peptide or 

protein without a signal or leader sequence. The "mature protein portion" means that portion of 
the protein which does not include a signal or leader sequence. The peptide may have been 
produced by processing in the cell that removes any leader/signal sequence. The mature protein 
portion may or may not include an initial methionine residue. The methionine residue may be 

30 removed from the protein during processing in the cell. The peptide may be produced 

synthetically or the protein may have been produced using a polynucleotide only encoding for 
the mature protein coding sequence. 

The term "derivative" refers to polypeptides chemically modified by such techniques as 
ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer 

35 attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 
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substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur 

in human proteins. 

The term "variant ,f (or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., 
recombinant DNA techniques. Guidance in determining which amino acid residues may be 
replaced, added or deleted without abolishing activities of interest, may be found by comparing 
the sequence of the particular polypeptide with that of homologous peptides and minimizing the 
number of amino acid sequence changes made in regions of high homology (conserved regions) 
or by replacing amino acids with consensus sequence. 

Alternatively, recombinant variants encoding these same or similar polypeptides may be 
synthesized or selected by making use of the "redundancy" in the genetic code. Various codon 
substitutions, such as the silent changes that produce various restriction sites, may be introduced 
to optimize cloning into a plasmid or viral vector or expression in a particular prokaryotic or 
eukaryotic system. Mutations in the polynucleotide sequence may be reflected in the 
polypeptide or domains of other peptides added to the polypeptide to modify the properties of 
any part of the polypeptide, to change characteristics such as Iigand-binding affinities, interchain 
affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, le., conservative amino 
acid replacements. "Conservative" amino acid substitutions may be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar 
neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and 
negatively charged (acidic) amino acids include aspartic acid and glutamic acid. "Insertions" or 
"deletions" are preferably in the range of about 1 to 20 amino acids, more preferably 1 to 10 
amino acids. The variation allowed may be experimentally determined by systematically making 
insertions, deletions, or substitutions of amino acids in a polypeptide molecule using 
recombinant DNA techniques and assaying the resulting recombinant variants for activity. ' 

Alternatively, where alteration of function is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such alterations 
can, for example, alter one or more of the biological functions or biochemical characteristics of 
the polypeptides of the invention. For example, such alterations may change polypeptide 
characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover 
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rate. Further, such alterations can be selected so as to generate polypeptides that are better suited 

for expression, scale up and the like in the host cells chosen for expression. For example, 

cysteine residues can be deleted or substituted with another amino acid residue in order to 

eliminate disulfide bridges. 

5 The terms "purified" or "substantially purified" as used herein denotes that the indicated 

nucleic acid or polypeptide is present in the substantial absence of other biological 

macromolecules, e.g., polynucleotides, proteins, and the like. In one embodiment, the 

polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more 

preferably at least 99% by weight, of the indicated biological macromolecules present (but water, 

10 buffers, and other small molecules, especially molecules having a molecular weight of less than 
1000 Daltons, can be present). 

The term "isolated" as used herein refers to a nucleic acid or polypeptide separated from 
at least one other component (e.g., nucleic acid or polypeptide) present with the nucleic acid or 
polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in 

1 5 the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a 
solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or 
polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or mammalian) 

20 expression systems. "Microbial" refers to recombinant polypeptides or proteins made in 
bacterial or fungal (e.g., yeast) expression systems. As a product, "recombinant microbial" 
defines a polypeptide or protein essentially free of native endogenous substances and 
unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most 
bacterial cultures, e.g., E. coIi y will be free of glycosylation modifications; polypeptides or 

25 proteins expressed in yeast will have a glycosylation pattern in general different from those 
expressed in mammalian cells. 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage or virus 
or vector, for expressing a polypeptide from a DN A (RNA) sequence. An expression vehicle can 
comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements 

30 having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural 
or coding sequence which is transcribed into mRNA and translated into protein, and (3) 
appropriate transcription initiation and tennination sequences. Structural units intended for use 
in yeast or eukaryotic expression systems preferably include a leader sequence enabling 
extracellular secretion of translated protein by a host cell. Alternatively, where recombinant 

35 protein is expressed without a leader or transport sequence, it may include an amino terminal 
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methionine residue. This residue may or may not be subsequently cleaved from the expressed 

recombinant protein to provide a final product. 

The term "recombinant expression system" means host cells that have stably integrated a 

recombinant transcriptional unit into chromosomal DNA or carry the recombinant transcriptional 

5 unit extrachromosomally. Recombinant expression systems as defined herein will express 

heterologous polypeptides or proteins upon induction of the regulatory elements linked to the 

DNA segment or synthetic gene to be expressed. This term also means host cells that have 

stably integrated a recombinant genetic element or elements having a regulatory role in gene 

expression, for example, promoters or enhancers. Recombinant expression systems as defined 

10 herein will express polypeptides or proteins endogenous to the cell upon induction of the 

regulatory elements linked to the endogenous DNA segment or gene to be expressed. The cells 
can be prokaryotic or eukaryotic. 

The term "secreted" includes a protein that is transported across or through a membrane, 
including transport as a result of signal sequences in its amino acid sequence when it is 

1 5 expressed in a suitable host cell. "Secreted" proteins include without limitation proteins secreted 
wholly (e.g., soluble proteins) or partially (e.g., receptors) from the cell in which they are 
expressed. "Secreted" proteins also include without limitation proteins that are transported 
across the membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to 
include proteins containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, 

20 P.A. and Young, P.R. (1992) Cytokine 4(2): 134 -143) and factors released from damaged cells 
(e.g. Interleukin-1 Receptor Antagonist, see Arend, W.P. et. al. (1998) Annu. Rev. Immunol. 
16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a sequence 
25 may be naturally present on the polypeptides of the present invention or provided from 
heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood in the 
art as stringent. Stringent conditions can include highly stringent conditions (i.e., hybridization 
to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65 
30 °C, and washing in 0.1X SSC/0.1% SDS at 68 °C), and moderately stringent conditions (i.e., 
washing in 0.2X SSC/0.1% SDS at 42 °C). Other exemplary hybridization conditions are 
described herein in the examples. 

In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent 
hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37 °C (for 

14 



WO 01/74836 PCT/US01/10472 
14-base oligonucleotides), 48 °C (for 17-base oligos), 55 °C (for 20-base oligonucleotides), and 

60 °C (for23-base oligonucleotides). 

As used herein, "substantially equivalent" or "substantially similar" can refer both to 

nucleotide and amino acid sequences, for example a mutant sequence, that varies from a 

5 reference sequence by one or more substitutions, deletions, or additions, the net effect of which 

does not result in an adverse functional dissimilarity between the reference and subject 

sequences. Typically, such a substantially equivalent sequence varies from one of those listed 

herein by no more than about 35% (/.e., the number of individual residue substitutions, additions, 

and/or deletions in a substantially equivalent sequence, as compared to the corresponding 

1 0 reference sequence, divided by the total number of residues in the substantially equivalent 
sequence is about 0.35 or less). Such a sequence is said to have 65% sequence identity to the 
listed sequence. In one embodiment, a substantially equivalent, e.g., mutant, sequence of the 
invention varies from a listed sequence by no more than 30% (70% sequence identity); in a 
variation of this embodiment, by no more than 25% (75% sequence identity); and in a further 

1 5 variation of this embodiment, by no more than 20% (80% sequence identity) and in a further 
variation of this embodiment, by no more than 10% (90% sequence identity) and in a further 
variation of this embodiment, by no more that 5% (95% sequence identity). Substantially 
equivalent, e.g., mutant, amino acid sequences according to the invention preferably have at least 
80% sequence identity with a listed amino acid sequence, more preferably at least 85% sequence 

20 identity, more preferably at least 90% sequence identity, more preferably at least 95% sequence 
identity, more preferably at least 98% sequence identity, and most preferably at least 99% 
sequence identity. Substantially equivalent nucleotide sequence of the invention can have lower 
percent sequence identities, taking into account, for example, the redundancy or degeneracy of 
the genetic code. Preferably, the nucleotide sequence has at least about 65% identity, more 

25 preferably at least about 75% identity, more preferably at least about 80% sequence identity, 
more preferably at least 85% sequence identity, more preferably at least 90% sequence identity, 
more preferably at least about 95% sequence identity, more preferably at least 98% sequence 
identity, and most preferably at least 99% sequence identity. For the purposes of the present 
invention, sequences having substantially equivalent biological activity and substantially 

30 equivalent expression characteristics are considered substantially equivalent. For the purposes of 
determining equivalence, truncation of the mature sequence (e.g., via a mutation which creates a 
spurious stop codon) should be disregarded. Sequence identity may be determined, e.g., using 
the Jotun Hein method (Hein, J. (1990) Methods Enzymol. 183:626-645). Identity between 
sequences can also be determined by other methods known in the art, e.g. by varying 

35 hybridization conditions. 
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The term "totipotent" refers to the capability of a cell to differentiate into all of the cell 

types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so that the 
DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. The 
term "transfection" refers to the taking up of an expression vector by a suitable host cell, whether 
or not any coding sequences are in fact expressed. The term "infection" refers to the introduction 
of nucleic acids into a suitable host cell by use of a virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of nucleotides 
that mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified 
using known UMFs as a target sequence or target motif with the computer-based systems 
described below. The presence and activity of a UMF can be confirmed by attaching the 
suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated 
with an appropriate host under appropriate conditions and the uptake of the marker sequence is 
determined. As described above, a UMF will increase the frequency of uptake of a linked 
marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless the 
context dictates otherwise. 
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3.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 
The isolated polynucleotides of the invention include a polynucleotide comprising the 
nucleotide sequences of SEQ ID NO: 1-84, or 168-251; a polynucleotide encoding any one of 
5 the peptide sequences of SEQ ID NO: 85-167, or 252-335; and a polynucleotide comprising the 
nucleotide sequence encoding the mature protein coding sequence of the polynucleotides of any 
one of SEQ ID NO: 1-84, or 168-251. The polynucleotides of the present invention also include, 
but are not limited to, a polynucleotide that hybridizes under stringent conditions to (a) the 
complement of any of the nucleotides sequences of SEQ ID NO: 1-84, or 168-251; (b) 

10 nucleotide sequences encoding any one of the amino acid sequences set forth in the Sequence 
Listing; (c) a polynucleotide which is an allelic variant of any polynucleotide recited above; (d) a 
polynucleotide which encodes a species homolog of any of the proteins recited above; or (e) a 
polynucleotide that encodes a polypeptide comprising a specific domain or truncation of the 
polypeptides of SEQ ID NO: 85-167, or 252-335. Domains of interest may depend on the nature 

15 of the encoded polypeptide; e.g., domains in receptor-like polypeptides include ligand-binding, 
extracellular, transmembrane, or cytoplasmic domains, or combinations thereof; domains in 
immunoglobulin-like proteins include the variable immunoglobulin-like domains; domains in 
enzyme-like polypeptides include catalytic and substrate binding domains; and domains in ligand 
polypeptides include receptor-binding domains. 

20 The polynucleotides of the invention include naturally occurring or wholly or partially 

synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The polynucleotides 
may include all of the coding region of the cDNA or may represent a portion of the coding 
region of the cDNA. 

The present invention also provides genes corresponding to the cDNA sequences disclosed 
25 herein. The corresponding genes can be isolated in accordance with known methods using the 

sequence information disclosed herein. Such methods include the preparation of probes or primers 
from the disclosed sequence information for identification and/or amplification of genes in 
appropriate genomic libraries or other sources of genomic materials. Further 5' and 3* sequence can 
be obtained using methods known in the art. For example, full length cDNA or genomic DNA that 
30 corresponds to any of the polynucleotides of SEQ ID NO: 1 -84, or 1 68-25 1 can be obtained by 
screening appropriate cDN A or genomic DNA libraries under suitable hybridization conditions 
using any of the polynucleotides of SEQ ID NO: 1 -84, or 1 68-25 1 or a portion thereof as a probe. 
Alternatively, the polynucleotides of SEQ ID NO: 1-84, or 168-251 may be used as the basis for 
suitable primer(s) that allow identification and/or amplification of genes in appropriate genomic 
35 DNA or cDNA libraries. 
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The nucleic acid sequences of the invention can be assembled from ESTs and sequences 

(including cDNA and genomic sequences) obtained from one or more public databases, such as 

dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information, 

representative fragment or segment information, or novel segment information for the full-length 

5 gene. 

The polynucleotides of the invention also provide polynucleotides including nucleotide 
sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides 
according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 
75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 

10 88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, and even more typically at 
least about 95%, 96%, 97%, 98%, 99% sequence identity to a polynucleotide recited above. 

Included within the scope of the nucleic acid sequences of the invention are nucleic acid 
sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences 
of SEQ ID NO: 1-84, or 168-251 or complements thereof, which fragment is greater than about 5 

15 nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most 

preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides or more that 
are selective for (i.e. specifically hybridize to) any one of the polynucleotides of the invention 
are contemplated. Probes capable of specifically hybridizing to a polynucleotide can 
differentiate polynucleotide sequences of the invention from other polynucleotide sequences in 

20 the same family of genes or can differentiate human genes from genes of other species, and are 
preferably based on unique nucleotide sequences. 

The sequences falling within the scope of the present invention are not limited to these 
specific sequences, but also include allelic and species variations thereof. Allelic and species 
variations can be routinely determined by comparing the sequences provided in SEQ ID NO : 1 -84, 

25 or 1 68-25 1 , a representative fragment thereof, or a nucleotide sequence at least 90% identical, 

preferably 95% identical, to SEQ ID NO: 1 -84, or 1 68-251 with a sequence from another isolate of 
the same species. Furthermore, to accommodate codon variability, the invention includes nucleic 
acid molecules coding for the same amino acid sequences as do the specific ORFs disclosed herein. 
In other words, in the coding region of an ORF, substitution of one codon for another codon that 

30 encodes the same amino acid is expressly contemplated. 

The nearest neighbor or homology result for the nucleic acids of the present invention, 
including SEQ ID NO: 1 -84, or 1 68-25 1 , can be obtained by searching a database using an 
algorithm or a program. Preferably, a BLAST, which stands for Basic Local Alignment Search 
Tool, is used to search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1993) 
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and Altschul S.F. et al. J. Mol. Biol. 2 1 :403-41 0 ( 1 990)). Alternatively a FASTA version 3 search 
against Genpept, using Fastxy algorithm could also be used. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also 
provided by the present invention. Species homologs may be isolated and identified by making 
suitable probes or primers from the sequences provided herein and screening a suitable nucleic 
acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides or 
proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide which also 
encode proteins which are identical, homologous or related to that encoded by the 
polynucleotides. 

The nucleic acid sequences of the invention are further directed to sequences that encode 
variants of the described nucleic acids. These amino acid sequence variants may be prepared by 
methods known in the art by introducing appropriate nucleotide changes into a native or variant 
polynucleotide. There are two variables in the construction of amino acid sequence variants: the 
location of the mutation and the nature of the mutation. Nucleic acids encoding the amino acid 
sequence variants are preferably constructed by mutating the polynucleotide to encode an amino 
acid sequence that does not occur in nature. These nucleic acid alterations can be made at sites 
that differ in the nucleic acids from different species (variable positions) or in highly conserved 
regions (constant regions). Sites at such locations will typically be modified in series, e.g., by 
substituting first with conservative choices (e.g., hydrophobic amino acid to a different 
hydrophobic amino acid) and then with more distant choices (e.g., hydrophobic amino acid to a 
charged amino acid), and then deletions or insertions may be made at the target site. Amino acid 
sequence deletions generally range from about 1 to 30 residues, preferably about 1 to 10 
residues, and are typically contiguous. Amino acid insertions include amino- and/or 
carboxyl-terminal fusions ranging in length from one to one hundred or more residues, as well as 
intrasequence insertions of single or multiple amino acid residues. Intrasequence insertions may 
range generally from about 1 to 10 amino residues, preferably from 1 to 5 residues. Examples of 
terminal insertions include the heterologous signal sequences necessary for secretion or for 
intracellular targeting in different host cells and sequences such as FLAG or poly-histidine 
sequences useful for purifying. the expressed protein. 

In a preferred method, polynucleotides encoding the novel amino acid sequences are 
changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter a 
polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 
nucleotides on both sides of the changed amino acid to form a stable duplex on either side of the 
site of being changed. In general, the techniques of site-directed mutagenesis are well known to 
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those of skill in the art and this technique is exemplified by publications such as, Edelman et al., 

DNA 2:183 (1983). A versatile and efficient method for producing site-specific changes in a 

polynucleotide sequence was published by Zoller and Smith, Nucleic Acids Res. 1 0:6487-6500 

(1982). PCR may also be used to create amino acid sequence variants of the novel nucleic acids. 

5 When small amounts of template DNA are used as starting material, primer(s) that differs 

slightly in sequence from the corresponding region in the template DNA can generate the desired 

amino acid variant. PCR amplification results in a population of product DNA fragments that 

differ from the polynucleotide template encoding the polypeptide at the position specified by the 

primer. The product DNA fragments replace the corresponding region in the plasmid and this 

10 gives a polynucleotide encoding the desired amino acid variant. 

A further technique for generating amino acid variants is the cassette mutagenesis 
technique described in Wells et aL, Gene 34:315 (1985); and other mutagenesis techniques well 
known in the art, such as, for example, the techniques in Sambrook et al., supra, and Current 
Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of the genetic 

15 code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be used in the practice of the invention for the cloning and expression 
of these novel nucleic acids. Such DNA sequences include those that are capable of hybridizing 
to the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can be used 

20 to generate polynucleotides encoding chimeric or fusion proteins comprising one or more 
domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of the 
polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, amplified, or 
synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known 

25 to those of skill in the art and can include, for example, methods for determining hybridization 
conditions that can routinely isolate polynucleotides of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature 
protein coding sequences corresponding to any one of SEQ ID NO: 1-84, or 168-251, or 
functional equivalents thereof, may be used to generate recombinant DNA molecules that direct 

30 the expression of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. 
Also included are the cDNA inserts of any of the clones identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of other 
nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Useful 

35 nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g., 
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plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the 

art. Accordingly, the invention also provides a vector including a polynucleotide of the 

invention and a host cell containing the polynucleotide. In general, the vector contains an origin 

of replication functional in at least one organism, convenient restriction endonuclease sites, and a 

selectable marker for the host cell. Vectors according to the invention include expression 

vectors, replication vectors, probe generation vectors, and sequencing vectors. A host cell 

according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular 

organism or part of a multicellular organism. 

The present invention further provides recombinant constructs comprising a nucleic acid 
having any of the nucleotide sequences of SEQ ID NO: 1-84, or 168-251or a fragment thereof or 
any other polynucleotides of the invention. In one embodiment, the recombinant constructs of 
the present invention comprise a vector, such as a plasmid or viral vector, into which a nucleic 
acid having any of the nucleotide sequences of SEQ ID NO: 1-84, or 168-25 lor a fragment 
thereof is inserted, in a forward or reverse orientation. In the case of a vector comprising one of 
the ORFs of the present invention, the vector may further comprise regulatory sequences, 
including for example, a promoter, operably linked to the ORF. Large numbers of suitable 
vectors and promoters are known to those of skill in the art and are commercially available for 
generating the recombinant constructs of the present invention. The following vectors are 
provided by way of example. Bacterial: pBS, phagescript, PsiX174, pBluescript SK, pBS KS, 
pNH8a, pNH16a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, 
pRIT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, 
pBPV, pMSG, pSVL (Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al, 
Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinantly. Many 
suitable expression control sequences are known in the art. General methods of expressing 
recombinant proteins are also known and are exemplified in R. Kaufman, Methods in 
Enzymology 185, 537-566 (1990). As defined herein "operably linked" means that the isolated 
polynucleotide of the invention and an expression control sequence are situated within a vector 
or cell in such a way that the protein is expressed by a host cell which has been transformed 
(transfected) with the ligated polynucleotide/expression control sequence. 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol 
transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, 
lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine 
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kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of 

the appropriate vector and promoter is well within the level of ordinary skill in the art. 

Generally, recombinant expression vectors will include origins of replication and selectable 

markers permitting transformation of the host cell, e.g. , the ampicillin resistance gene of E coli 

5 and S. cerevisiae TKPl gene, and a promoter derived from a highly-expressed gene to direct 

transcription of a downstream structural sequence. Such promoters can be derived from operons 

encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid 

phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 

assembled in appropriate phase with translation initiation and termination sequences, and 

1 0 preferably, a leader sequence capable of directing secretion of translated protein into the 

periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a 
fusion protein including an amino terminal identification peptide imparting desired 
characteristics, e.g., stabilization or simplified purification of expressed recombinant product. 
Useful expression vectors for bacterial use are constructed by inserting a structural DNA 

1 5 sequence encoding a desired protein together with suitable translation initiation and termination 
signals in operable reading phase with a functional promoter. The vector will comprise one or 
more phenotypic selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coli, Bacillus subtilis, Salmonella typhimurium and various species 

20 within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may also be 
employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived from commercially 
available plasmids comprising genetic elements of the well known cloning vector pBR322 

25 (ATCC 3701 7). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the structural 
sequence to be expressed. Following transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected promoter is induced or derepressed by 

30 appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an 
additional period. Cells are typically harvested by centrifugation, disrupted by physical or 
chemical means, and the resulting crude extract retained for further purification. 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et al., Nat Biotech. 17:870-872 (1999), incorporated herein by 

35 reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 
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against the encoded polypeptide following topical administration of naked plasmid DNA or 

following injection, and preferably intramuscular injection of the DNA. The nucleic acid 

sequences are preferably inserted in a recombinant expression vector and may be in the form of 

naked DNA. 

5 

3.2.1 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO: 1-84, or 168-251, or fragments, analogs or derivatives thereof. An 

10 "antisense" nucleic acid comprises a nucleotide sequence that is complementary to a "sense" 
nucleic acid encoding a protein, e.g., complementary to the coding strand of a double-stranded 
cDNA molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic 
acid molecules are provided that comprise a sequence complementary to at least about 10, 25, 
50, 1 00, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof Nucleic 

15 acid molecules encoding fragments, homologs, derivatives and analogs of a protein of any of 
SEQ ID NO: 85-167, or 252-335 or antisense nucleic acids complementary to a nucleic acid 
sequence of SEQ ID NO: 1-84, or 168-251are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence of the invention. The term "coding region" refers 

20 to the region of the nucleotide sequence comprising codons which are translated into amino acid 
residues. In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence of the invention. The term 
"noncoding region" refers to 5 r and 3' sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions). 

25 Given the coding strand sequences encoding a nucleic acid disclosed herein [e.g., SEQ 

ID NO: 1-84, or 168-251), antisense nucleic acids of the invention can be designed according to 
the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule 
can be complementary to the entire coding region of a mRNA, but more preferably is an 
oligonucleotide that is antisense to only a portion of the coding or noncoding region of a mRNA. 

30 For example, the antisense oligonucleotide can be complementary to the region surrounding the 
translation start site of a mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 
15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis or enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can 

35 be chemically synthesized using naturally occurring nucleotides or variously modified 

23 



WO 01/74836 PCT/US01/10472 
nucleotides designed to increase the biological stability of the molecules or to increase the 

physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., 

phosphorothioate derivatives and acridine substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense nucleic acid 

5 include: 5-fluorouraciI, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 

4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 

2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 

inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 

2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 

10 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-manriosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 
2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

15 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 

20 The antisense nucleic acid molecules of the invention are typically administered to a 

subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a protein according to the invention to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of 

25 an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 

30 such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., 
by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also' be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of antisense 
molecules, vector constructs in which the antisense nucleic acid molecule is placed under the 

3 5 control of a strong pol II or pol III promoter are preferred. 
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In yet another embodiment, the antisense nucleic acid molecule of the invention is an 

a-anomeric nucleic acid molecule. An cc-anomeric nucleic acid molecule forms specific 

double-stranded hybrids with complementary RNA in which, contrary to the usual p-units, the 

strands run parallel to each other (Gaultier et al (1 987) Nucleic Acids Res 15: 6625-6641 ). The 

5 antisense nucleic acid molecule can also comprise a 2'-o-methyIribonucIeotide (Inoue et al 

(1987) Nucleic Acids Res 15: 6131-6148) or a chimeric RNA -DNA analogue (Inoue et al (1987) 

FEBSLett 21 5: 327-330). 

3.2.2 RIBOZYMES AND PNA MOIETIES 

10 In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 

Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 
single-stranded nucleic acid, such as a mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave a mRNA transcripts to thereby inhibit 

15 translation of a mRNA. A ribozyme having specificity for a nucleic acid of the invention can be 
designed based upon the nucleotide sequence of a DNA disclosed herein (i.e., SEQ ID NO: 1-84, 
or 168-251). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in 
which the nucleotide sequence of the active site is complementary to the nucleotide sequence to 
be cleaved in an mRNA of SEQ ID NO: 1-84, or 168-251 (see, e.g., Cech etal U.S. Pat. No. 

20 4,987,071; and Cech et al U.S. Pat. No. 5,1 16,742). Alternatively, polynucleotides of the 
invention can be used to select a catalytic RNA having a specific ribonuclease activity from a 
pool of RNA molecules. See, e.g., Bartel et al, (1993) Science 261:141 1-1418. 

Alternatively, gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical 

25 structures that prevent transcription of the gene in target cells. See generally, Helene. (1 991) 
Anticancer Drug Des. 6: 569-84; Helene. etal (1992) Ann. NY. Acad Set 660:27-36; and 
Maher (1992) Bioassays 14: 807-15. 

In various embodiments, the nucleic acids of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or 

30 solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic 
acids can be modified to generate peptide nucleic acids (see Hyrup et al (1996) Bioorg Med 
Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 

35 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
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conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 

standard solid phase peptide synthesis protocols as described in Hyrup et aL (1996) above; 

Perry-O'Keefe et aL (1996) PNAS 93: 14670-675. 

PNAs of the invention can be used in therapeutic and diagnostic applications. For 

5 example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of 

gene expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 

PNAs of the invention can also be used, e.g. , in the analysis of single base pair mutations in a 

gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in 

combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); or as probes or 

10 primers for DNA sequence and hybridization (Hyrup et aL (1996), above; Perry-O'Keefe (1996), 

above). 

In another embodiment, PNAs of the invention can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 

1 5 delivery known in the art. For example, PNA-DNA chimeras can be generated that may 

combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA 
portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked 
using linkers of appropriate lengths selected in terms of base stacking, number of bonds between 

20 the nucleobases, and orientation (Hyrup (1996) above). The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1996) above and Finn et aL (1996) Nucl Acids Res 24: 
3357-63. For example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 
5-(4-methoxytrityl)amino-5 -deoxy-thymidine phosphoramidite, can be used between the PNA 

25 and the 5' end of DNA (Mag et aL (1989) Nucl Acid Res 1 7: 5973-88). PNA monomers are then 
coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' 
DNA segment (Finn et aL (1996) above). Alternatively, chimeric molecules can be synthesized 
with a 5' DNA segment and a 3' PNA segment. See, Petersen et aL (1975) BioorgMed Chem 
LettS: 1119-11124. 

30 In other embodiments, the oligonucleotide may include other appended groups such as 

peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the 
cell membrane (see, e.g., Letsinger et aL, 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; 
Lemaitre et aL, 1 987, Proc. Natl Acad Sci. 84:648-652; PCT Publication No. W088/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. W089/1 0134). In addition, 

35 oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g., Krol et 
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al., 1988, BioTecfmiques 6:958-976) or intercalating agents. (See, e.g., Zon, 1988, Pharm. Res. 

5: 539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, etc. 



3.3 HOSTS 

The present invention further provides host cells genetically engineered to contain the 
polynucleotides of the invention. For example, such host cells may contain nucleic acids of the 
invention introduced into the host cell using known transformation, transfection or infection 
methods. The present invention still further provides host cells genetically engineered to express 
the polynucleotides of the invention, wherein such polynucleotides are in operative association 
with a regulatory sequence heterologous to the host cell that drives expression of the 
polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
increase, expression of endogenous polypeptide. Cells can be modified (e.g., by homologous 
recombination) to provide increased polypeptide expression by replacing, in whole or in part, the 
naturally occurring promoter witb/all or part of a heterologous promoter so that the cells express 
the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it 
is operatively linked to the encoding sequences. See, for example, PCT International Publication 
No. WO94/12650, PCT International Publication No. WO92/20808, and PCT International 
Publication No. WO91/09955. It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the coding 
sequence, amplification of the marker DNA by standard selection methods results in co- 
amplification of the desired protein coding sequences in the cells. 

The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 
eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis, 
L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one of the 
polynucleotides of the invention, can be used in conventional manners to produce the gene 
product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a 
heterologous protein under the control of the EMF. 
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Any host/vector system can be used to express one or more of the ORFs of the present 
invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 cell, 
COS cells, 293 cells, and Sf9 cells, as well as prokaiyotic host such as R coli and B. subtilis. 
The most preferred cells are those which do not normally express the particular polypeptide or 
protein or which expresses the polypeptide or protein at low natural level. Mature proteins can 
be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins using . 
RNAs derived from the DNA constructs of the present invention. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et 
al., in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New 
York (1989), the disclosure of which is hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express recombinant 
protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney 
fibroblasts; described by Gluzman, Cell 23:175 (1981). Other cell lines capable of expressing a 
compatible vector are, for example, the CI 27, monkey COS cells, Chinese Hamster Ovary 
(CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 cells, 3T3 
cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell strains derived 
from in vitro culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK, 
HL-60, U937, HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of 
replication, a suitable promoter and also any necessary ribosome binding sites, polyadenylation 
site, splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking 
nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, 
SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide 
the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced 
in bacterial culture are usually isolated by initial extraction from cell pdkts, followed by one or 
more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature protein. 
Finally, high perfoimance liquid chromatography (HPLC) can be employed for final purification 
steps. Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 
agents. 

Alternatively, it may be possible to produce the protein in lower eukaryotes such as yeast 
or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or 
any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
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strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 

strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it 

may be necessary to modify the protein produced therein, for example by phosphorylation or 

glycosylation of the appropriate sites, in order to obtain the functional protein. Such covalent 

attachments may be accomplished using known chemical or enzymatic methods. 

In another embodiment of the present invention, cells and tissues may be engineered to 

express an endogenous gene comprising the polynucleotides of the invention under the control of 

inducible regulatory elements, in which case the regulatory sequences of the endogenous gene 

may be replaced by homologous recombination. As described herein, gene targeting can be used 

to replace a gene's existing regulatory region with a regulatory sequence isolated from a different 

gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 

regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, 

negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 

combinations of said sequences. Alternatively, sequences that affect the structure or stability of 

the RNA or protein produced may be replaced, removed, added, or otherwise modified by 

targeting. These sequences include polyadenylation signals, mRNA stability elements, splice 

sites, leader sequences for enhancing or modifying transport or secretion properties of the 

protein, or other sequences that alter or improve the function or stability of protein or RNA 

molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion 
of a regulatory element, such as the deletion of a tissue-specific negative regulatory element. 
Alternatively, the targeting event may replace an existing element; for example, an enhancer that 
has broader or different cell-type specificity than the naturally occurring elements can replace a 
tissue-specific enhancer. Here, the naturally occurring sequences are deleted and new sequences 
are added. In all cases, the identification of the targeting event may be facilitated by the use of 
one or more selectable marker genes that are contiguous with the targeting DNA, allowing for 
the selection of cells in which the exogenous DNA has integrated into the host cell genome. The 
identification of the targeting event may also be facilitated by the use of one or more marker 
genes exhibiting the property of negative selection, such that the negatively selectable marker is 
linked to the exogenous DNA, but configured such that the negatively selectable marker flanks 
the targeting sequence, and such that a correct homologous recombination event with sequences 
in the host cell genome does not result in the stable integration of the negatively selectable 
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marker. Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase 

(TK) gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques, which can be used in accordance with 

this aspect of the invention, are more particularly described in U.S. Patent No. 5,272,071 to 

5 Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. 

PCTYUS92/09627 (WO93/09222) by Selden et al.; and International Application No. 

PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference 

herein in its entirety. 

10 3.4 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising: the amino acid sequences set forth as any one of SEQ ID NO: 85-167, or 252-335 or 
an amino acid sequence encoded by any one of the nucleotide sequences SEQ ID NO: 1-84, or 
168-251 or the corresponding full length or mature protein. Polypeptides of the invention also 

15 include polypeptides preferably with biological or immunological activity that are encoded by: 
(a) a polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO: 1-84, 
or 168-25 lor (b) polynucleotides encoding any one of the amino acid sequences set forth as SEQ 
ID NO: 85-167, or 252-335 or (c) polynucleotides that hybridize to the complement of the 
polynucleotides of either (a) or (b) under stringent hybridization conditions. The invention also 

20 provides biologically active or immunologically active variants of any of the amino acid 
sequences set forth as SEQ ID NO: 85-167, or 252-335 or the corresponding full length or 
mature protein; and "substantial equivalents" thereof (e.g., with at least about 65%, at least 
about 70%, at least about 75%, at least about 80%, at least about 85%, 86%, 87%, 88%, 89%, at 
least about 90%, 91%, 92%, 93%, 94%, typically at least about 95%, 96%, 97%, more typically 

25 at least about 98%, or most typically at least about 99% amino acid identity) that retain 

biological activity. Polypeptides encoded by allelic variants may have a similar, increased, or 
decreased activity compared to polypeptides comprising SEQ ID NO: 85-167, or 252-335. 

Fragments of the proteins of the present invention that are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the protein may 

30 be in linear form or they may be cyclized using known methods, for example, as described in H. 
U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. McDowell, et al., J. Amer. 
Chem. Soc. 1 14, 9245-9253 (1992), both of which are incorporated herein by reference. Such 
fragments may be fused to carrier molecules such as immunoglobulins for many purposes, 
including increasing the valency of protein binding sites. 
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The present invention also provides both fuil-Iength and mature forms (for example, 

without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding 

sequence is identified in the sequence listing by translation of the disclosed nucleotide 

sequences. The mature form of such protein may be obtained by expression of a full-length 

5 polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature form 

of the protein is also determinable from the amino acid sequence of the full-length form. Where 

proteins of the present invention are membrane bound, soluble forms of the proteins are also 

provided. In such forms, part or all of the regions causing the proteins to be membrane bound 

are deleted so that the proteins are fully secreted from the cell in which it is expressed. 

10 Protein compositions of the present invention may further comprise an acceptable carrier, 

such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 

The present invention further provides isolated polypeptides encoded by the nucleic acid 
fragments of the present invention or by degenerate variants of the nucleic acid fragments of the 
present invention. By "degenerate variant" is intended nucleotide fragments that differ from a 

1 5 nucleic acid fragment of the present invention (e.g. , an ORF) by nucleotide sequence but, due to 
the degeneracy of the genetic code, encode an identical polypeptide sequence. Preferred nucleic 
acid fragments of the present invention are the ORFs that encode proteins. 

A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present invention. At the simplest level, the amino acid 

20 sequence can be synthesized using commercially available peptide synthesizers. The 

synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary 
structural and/or conformational characteristics with proteins may possess biological properties 
in common therewith, including protein activity. This technique is particularly useful in 
producing small peptides and fragments of larger polypeptides. Fragments are useful, for 

25 example, in generating antibodies against the native polypeptide. Thus, they may be employed 
as biologically active or immunological substitutes for natural, purified proteins in screening of 
therapeutic compounds and in immunological processes for the development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified from 
cells that have been altered to express the desired polypeptide or protein. As used herein, a cell 

30 is said to be altered to express a desired polypeptide or protein when the cell, through genetic 
manipulation, is made to produce a polypeptide or protein which it normally does not produce or 
which the cell normally produces at a lower level. One skilled in the art can readily adapt 
procedures for introducing and expressing either recombinant or synthetic sequences into 
eukaryotic or prokaryotic cells in order to generate a cell that produces one of the polypeptides or 

35 proteins of the present invention. 
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The invention also relates to methods for producing a polypeptide comprising growing a 

culture of host cells of the invention in a suitable culture medium, and purifying the protein from 
the cells or the culture in which the cells are grown. For example, the methods of the invention 
include a process for producing a polypeptide in which a host cell containing a suitable 
expression vector that includes a polynucleotide of the invention is cultured under conditions that 
allow expression of the encoded polypeptide. The polypeptide can be recovered from the 
culture, conveniently from the culture medium, or from a lysate prepared from the host cells and 
further purified. Preferred embodiments include those in which the protein produced by such 
process is a full length or mature form of the protein. 

In an alternative method, the polypeptide or protein is purified from bacterial ceils that 
naturally produce the polypeptide or protein. One skilled in the art can readily follow known 
methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present invention. These include, but are not limited to, 
immunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, 
and immuno-affinity chromatography. See, e.g., Scopes, Protein Purification: Principles and 
Practice, Springer-Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory 
Manual; Ausubel et al., Current Protocols in Molecular Biology. Polypeptide fragments that 
retain biological/immunological activity include fragments comprising greater than about 100 
amino acids, or greater than about 200 amino acids, and fragments that encode specific protein 
domains. 

The purified polypeptides can be used in in vitro binding assays that are well known in 
the art to identify molecules that bind to the polypeptides. These molecules include but are not 
limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other 
proteins. The molecules identified in the binding assay are then tested for antagonist or agonist 
activity in in vivo tissue culture or animal models that are well known in the art. In brief, the 
molecules are titrated into a plurality of cell cultures or animals and then tested for either 
cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the peptides 
may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to 
cells. The toxin-binding molecule complex is then targeted to a tumor or other cell by the 
specificity of the binding molecule for SEQ ID NO: 85-167, or 252-335. 

The protein of the invention may also be expressed as a product of transgenic animals, 
e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized 
by somatic or germ cells containing a nucleotide sequence encoding the protein. 
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The proteins provided herein also include proteins characterized by amino acid sequences 

similar to those of purified proteins but into which modification are naturally provided or 

deliberately engineered. For example, modifications, in the peptide or DNA sequence, can be 

made by those skilled in the art using known techniques. Modifications of interest in the protein 

5 sequences may include the alteration, substitution, replacement, insertion or deletion of a 

selected amino acid residue in the coding sequence. For example, one or more of the cysteine 

residues may be deleted or replaced with another amino acid to alter the conformation of the 

molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are 

well known to those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584). Preferably, such 

10 alteration, substitution, replacement, insertion or deletion retains the desired activity of the 

protein. Regions of the protein that are important for the protein function can be determined by 
various methods known in the art including the alanine-scanning method which involved 
systematic substitution of single or strings of amino acids with alanine, followed by testing the 
resulting danine-containing variant for biological activity. This type of analysis determines.the 

1 5 importance of the substituted amino acid(s) in biological activity. Regions of the protein that are 
important for protein function may be determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be expected to 
retain protein activity in whole or in part and are useful for screening or other immunological 
methodologies may also be easily made by those skilled in the art given the disclosures herein. 

20 Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of the 
invention to suitable control sequences in one or more insect expression vectors, and employing 
an insect expression system. Materials and methods for baculovirus/insect cell expression 
systems are commercially available in kit form from, e.g., Invitrogen, San Diego, Calif, U.S.A. 

25 (the MaxBat™ kit), and such methods are well known in the art, as described in Summers and 
Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by 
reference. As used herein, an insect cell capable of expressing a polynucleotide of the present 
invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host cells under 

30 culture conditions suitable to express the recombinant protein. The resulting expressed protein 
may then be purified from such culture (i.e., from culture medium or cell extracts) using known 
purification processes, such as gel filtration and ion exchange chromatography. The purification 
of the protein may also include an affinity column containing agents which will bind to the 
protein; one or more column steps over such affinity resins as concanavalin A-agarose, 

35 heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; one or more steps involving 
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hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoaffinity chromatography. 

Alternatively, the protein of the invention may also be expressed in a form, which will 
facilitate purification. For example, it may be expressed as a fusion protein, such as those of 
5 maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a 
His tag. Kits for expression and purification of such fusion proteins are commercially available 
from New England BioLabs (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, 
respectively. The protein can also be tagged with an epitope and subsequently purified by using 
a specific antibody directed to such epitope. One such epitope ("FLAG®") is commercially 
1 0 available from Kodak (New Haven, Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- HPLC) 
steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other 
aliphatic groups can be employed to further purify the protein. Some or all of the foregoing 
purification steps, in various combinations, can also be employed to provide a substantially 
1 5 homogeneous isolated recombinant protein. The protein thus purified is substantially free of 

other mammalian proteins and is defined in accordance with the present invention as an ''isolated 
protein." 

The polypeptides of the invention include analogs (variants). This embraces fragments, 
as well as peptides in which one or more amino acids has been deleted, inserted, or substituted. 

20 Also, analogs of the polypeptides of the invention embrace fusions of the polypeptides or 

modifications of the polypeptides of the invention, wherein the polypeptide or analog is fused to 
another moiety or moieties, e.g., targeting moiety or another therapeutic agent. Such analogs 
may exhibit improved properties such as activity and/or stability. Examples of moieties which 
may be fused to the polypeptide or an analog include, for example, targeting moieties which 

25 provide for the delivery of polypeptide to pancreatic cells, e.g., antibodies to pancreatic cells, 

antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well 
as receptor and ligands expressed on pancreatic or immune cells. Other moieties, which may be 
fused to the polypeptide, include therapeutic agents that are used for treatment, for example, 
immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 antibodies and 

30 steroids. Also, polypeptides may be fused to immune modulators, and other cytokines such as 
alpha or beta interferon. 

3.4.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY 
AND SIMILARITY 
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Preferred identity and/or similarity are designed to give the largest match between the 

sequences tested. Methods to determine identity and similarity are codified in computer 

programs including, but are not limited to, the GCG program package, including GAP 

(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 

5 University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, S.F. 

et al., J. Mol. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et ah, Nucleic Acids Res. 

vol. 25, pp. 3389-3402, herein incorporated by reference), eMATRIX software (Wu et al., J. 

Comp. Biol., Vol. 6, pp. 219-235 (1999), herein incoiporated by reference), eMotif software 

(NeviU-Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by reference), PFam 

1 0 software (Sonnhammer et al., Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein 

incorporated by reference), SignalP software package (Nielsen H et al., Int. J. Neural Syst, Vol. 
8, pp. 581 - 599 (1997), herein incorporated by reference) and the Kyte-Doolittle 
hydrophobocity prediction algorithm (J. Mol. Biol, 157, pp. 105-31 (1982), incorporated herein 
by reference). The BLAST programs are publicly available from the National Center for 

15 Biotechnology Information (NCBI) and other sources (BLAST Manual, Altschul, S. s et al. NCB 
NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. Biol. 215:403-410 (1990). 

3.4.2 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 
protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 

20 another polypeptide. Within a fusion protein the polypeptide according to the invention can 
correspond to all or a portion of a protein according to the invention. In one embodiment, a 
fusion protein comprises at least one biologically active portion of a protein according to the 
invention. In another embodiment, a fusion protein comprises at least two biologically active 
portions of a protein according to the invention. Within the fusion protein, the term "operatively 

25 linked" is intended to indicate that the polypeptide(s) according to the invention and the other 
polypeptide(s) are fused in-frame to each other. The polypeptide can be fused to the N-terminus 
or C-terminus or in the middle. 

For example, in one embodiment a fusion protein comprises a polypeptide according to 
the invention operably linked to the extracellular domain of a second protein. 

30 In another embodiment, the fusion protein is a GST-fusion protein in which the 

polypeptide sequences of the invention are fused to the C-terminus of the GST (i.e., glutathione 
S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in which 
the polypeptide sequences according to the invention comprise one or more domains fused to 

35 sequences derived from a member of the immunoglobulin protein family. The immunoglobulin 
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fusion proteins of the invention can be incorporated into pharmaceutical compositions and 

administered to a subject to inhibit an interaction between a ligand and a protein of the invention 

on the surface of a cell, to thereby suppress signal transduction in vivo. The immunoglobulin 

fusion proteins can be used to affect the bioavailability of a cognate ligand. Inhibition of the 

5 ligand/protein interaction may be useful therapeutically for both the treatment of proliferative 

and differentiative disorders, e,g. f cancer as well as modulating (e.g., promoting or inhibiting) 

cell survival. Moreover, the immunoglobulin fusion proteins of the invention can be used to 

bind and to dimerize 2 receptors and thereby transduce an intracellular signal. The 

immunoglobulin fusion proteins may also be used as immunogens to produce antibodies in a 

1 0 subject, to purify ligands, and in screening assays to identify molecules that inhibit the 
interaction of a polypeptide of the invention with a ligand. 

A chimeric or fusion protein of the invention can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide sequences 
are ligated together in-frame in accordance with conventional techniques, e.g., by employing 

15 blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can 
be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor primers that 

20 give rise to complementary overhangs between two consecutive gene fragments that can 
subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for 
example, Ausubel et al. (eds.) Current Protocols in Molecular Biology, John Wiley & 
Sons, 1992). Moreover, many expression vectors are commercially available that already encode 
a fusion moiety (e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the 

25 invention can be cloned into such an expression vector such that the fusion moiety is linked 
in-frame to the protein of the invention. 

3.5 GENE THERAPY 

Mutations in the polynucleotides of the invention may result in loss of normal function of 
30 the encoded protein. The invention thus provides gene therapy to restore normal activity of the 
polypeptides of the invention; or to treat disease states involving polypeptides of the invention. 
Delivery of a functional gene encoding polypeptides of the invention to appropriate cells is 
effected ex v/vo, in situ, or in vivo by use of vectors, and more particularly viral vectors (e.g., 
adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of physical DNA transfer 
35 methods (e.g., liposomes or chemical treatments). See, for example, Anderson, Nature, 
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supplement to vol. 392, no. 6679, pp.25-20 (1998). For additional reviews of gene therapy 

technology see Friedmann, Science, 244: 1275-1281 (1989); Veima, Scientific American: 68-84 

(1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of the nucleotides of 

the present invention or a gene encoding the polypeptides of the present invention can also be 
5 accomplished with extrachromosomal substrates (transient expression) or artificial chromosomes 

(stable expression). Cells may also be cultured ex vivo in the presence of proteins of the present 

invention in order to proliferate or to produce a desired effect on or activity in such cells. 

Treated cells can then be introduced in vivo for therapeutic purposes. Alternatively, it is 

contemplated that in other human disease states, preventing the expression of or inhibiting the 
10 activity of polypeptides of the invention will be useful in treating the disease states. It is 

contemplated that antisense therapy or gene therapy could be applied to negatively regulate the 

expression of polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of antisense 

molecules to the nucleic acids of the present invention, their complements, or their translated RNA 
1 5 sequences, by methods known in the art. Further, the polypeptides of the present invention can be 

inhibited by using targeted deletion methods, or the insertion of a negative regulatory element such 

as a silencer, which is tissue specific. 

The present invention still further provides cells genetically engineered in vivo to express the 

polynucleotides of the invention, wherein such polynucleotides are in operative association with a 
20 regulatory sequence heterologous to the host cell, which drives expression of the polynucleotides in 

the cell. These methods can be used to increase or decrease the expression of the polynucleotides of 

the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of cells to 
permit, increase, or decrease, expression of endogenous polypeptide. Cells can be modified (e.g., by 

25 homologous recombination) to provide increased polypeptide expression by replacing, in whole or 
in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells 
express the protein at higher levels. The heterologous promoter is inserted in such a manner that it is 
operatively linked to the desired protein encoding sequences. See, for example, PCT International 
PublicationNo. WO 94/12650,PCT International PublicationNo. WO 92/20808, and PCT 

30 International Publication No. WO 91/09955. It is also contemplated that, in addition to heterologous 
promoter DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the desired 
protein coding sequence, amplification of the marker DNA by standard selection methods results in 

35 co-amplification of the desired protein coding sequences in the cells. 
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In another embodiment of the present invention, cells and tissues may be engineered to 

express an endogenous gene comprising the polynucleotides of the invention under the control of 

inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may 

be replaced by homologous recombination. As described herein, gene targeting can be used to 

5 replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene 

or a novel regulatory sequence synthesized by genetic engineering methods. Such regulatory 

sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative 

regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations 

of said sequences. Alternatively, sequences that affect the structure or stability of the RN A or 

1 0 protein produced may be replaced, removed, added, or otherwise modified by targeting. These 

sequences include polyadenylation signals, mRNA stability elements, splice sites, leader sequences 
for enhancing or modifying transport or secretion properties of the protein, or other sequences that 
alter or improve the function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the gene 

1 5 under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both 
upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory 
element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the 
targeting event may replace an existing element; for example, a tissue-specific enhancer can be 
replaced by an enhancer that has broader or different cell-type specificity than the naturally 

20 occurring elements. Here, the naturally occurring sequences are deleted and new sequences are 
added. In all cases, the identification of the targeting event may be facilitated by the use of one or 
more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection 
of cells in which the exogenous DNA has integrated into the cell genome. The identification of the 
targeting event may also be facilitated by the use of one or more marker genes exhibiting the 

25 property of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and 
such that a correct homologous recombination event with sequences in the host cell genome does 
not result in the stable integration of the negatively selectable marker. Markers useful for this 
purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial 

3 0 xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques that can be used in accordance with this 
aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to Chappel; 
U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. PCT/US92/09627 
(WO93/09222) by Selden et al.; and International Application No. PCT/US90/06436 

35 (WO91/06667)by Skoultchi et al., each of which is incorporated by reference herein in its entirety. 
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3.6 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 
5 inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 

10 prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. 

Transgenic animals . are useful to determine the roles polypeptides of the invention play in 
biological processes, and preferably in disease states. Transgenic animals are useful as model 
systems to identify compounds that modulate lipid metabolism. Transgenic animals, preferably 
non-human mammals, are produced using methods as described in U.S. Patent No 5,489,743 and 

15 PCT Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or inactivated to alter the level of expression 
of the polypeptides of the invention. Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or even 

20 replacing the homologous promoter to provide for increased protein expression. The 

homologous promoter can be supplemented by insertion of one or more heterologous enhancer 
elements known to confer promoter activation in a particular tissue. 

The polynucleotides of the present invention also make possible the development, 
through, e.g., homologous recombination or knock out strategies, of animals that fail to express 

25 polypeptides of the invention or that express a variant polypeptide. Such animals are useful as 
models for studying the in vivo activities of polypeptide as well as for studying modulators of the 
polypeptides of the invention. 

3.7 USES AND BIOLOGICAL ACTIVITY 

30 The polynucleotides and proteins of the present invention are expected to exhibit one or 

more of the uses or biological activities (including those associated with assays cited herein) 
identified herein. Uses or activities described for proteins of the present invention may be 
provided by administration or use of such proteins or of polynucleotides encoding such proteins 
(such as, for example, in gene therapies or vectors suitable for introduction of DNA). The 

35 mechanism underlying the particular condition or pathology will dictate whether the 
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polypeptides of the invention, the polynucleotides of the invention or modulators (activators or 

inhibitors) thereof would be beneficial to the subject in need of treatment Thus, "therapeutic 

compositions of the invention" include compositions comprising isolated polynucleotides 

(including recombinant DNA molecules, cloned genes and degenerate variants thereof) or 

5 polypeptides of the invention (including full length protein, mature protein and truncations or 

domains thereof), or compounds and other substances that modulate the overall activity of the 

target gene products, either at the level of target gene/protein expression or target protein 

activity. Such modulators include polypeptides, analogs, (variants), including fragments and 

fusion proteins, antibodies and other binding proteins; chemical compounds that directly or 

1 0 indirectly activate or inhibit the polypeptides of the invention (identified, e.g., via drug screening 
assays as described herein); antisense polynucleotides and polynucleotides suitable for triple 
helix formation; and in particular antibodies or other binding partners that specifically recognize 
one or more epitopes of the polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular activation 

15 or in one of the other physiological pathways described herein. 

3.7.1 RESEARCH USES AND UTILITIES 

The research community can use the polynucleotides provided by the present invention 
for various purposes. The polynucleotides can be used to express recombinant protein for 

20 analysis, characterization or therapeutic use; as markers for tissues in which the corresponding 
protein is preferentially expressed (either constitutively or at a particular stage of tissue 
differentiation or development or in disease states); as molecular weight markers on gels; as 
chromosome markers or tags (when labeled) to identify chromosomes or to map related gene 
positions; to compare with endogenous DNA sequences in patients to identify potential genetic 

25 disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; for selecting and making 
oligomers for attachment to a "gene chip" or other support, including for examination of 
expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and as 

30 an antigen to raise anti-DNA antibodies or elicit another immune response. Where the 

polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for 
example, in a receptor-ligand interaction), the polynucleotide can also be used in interaction trap 
assays (such as, for example, that described in Gyuris et al., Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of 

35 the binding interaction. 
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The polypeptides provided by the present invention can similarly be used in assays to 

determine biological activity, including in a panel of multiple proteins for high-throughput 

screening; to raise antibodies or to elicit another immune response; as a reagent (including the 

labeled reagent) in assays designed to quantitatively determine levels of the protein . (or its 

5 receptor) in biological fluids; as markers for tissues in which the corresponding polypeptide is 

preferentially expressed (either constitutively or at a particular stage of tissue differentiation or 

development or in a disease state); and, of course, to isolate correlative receptors or ligands. 

Proteins involved in these binding interactions can also be used to screen for peptide or small 

molecule inhibitors or agonists of the binding interaction. 

10 Any or all of these research utilities are capable of being developed into reagent grade or 

kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the art. 
References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch 

15 and T. Maniatis eds., 1989, and "Methods in Enzymplogy: Guide to Molecular Cloning 
Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

3.7.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as nutritional 
20 sources or supplements. Such uses include without limitation use as a protein or amino acid 

supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In 
such cases the polypeptide or polynucleotide of the invention can be added to the feed of a 
particular organism or can be administered as a separate solid or liquid preparation, such as in the 
form of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the 
25 polypeptide or polynucleotide of the invention can be added to the medium in or on which the 
microorganism is cultured. 

3.73 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 
ACTIVITY 

30 A polypeptide of the present invention may exhibit activity relating to cytokine, cell 

proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) 
activity or may induce production of other cytokines in certain cell populations. A 
polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many 
protein factors discovered to date, including all known cytokines, have exhibited activity in one 

35 or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient 
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confirmation of cytokine activity. The activity of therapeutic compositions of the present 

invention is evidenced by any one of a number of routine factor dependent cell proliferation 

assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/1 1, BaF3, 

MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, Tl 165, HT2, CTLL2, TF-1, Mo7e, CMK, 

5 HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following: 

Assays for T-cell or thymocyte proliferation include without limitation those described 

in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 

M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 

In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 

10 Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol. 

145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Bertagnolli, 
et al., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or 
thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 

1 5 Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse 
and human interleukin-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 

20 include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 
and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; 
deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al., Nature 336:690-692, 1988; 
Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse 

25 and human interleukin 6-Nordan, R. In Current Protocols in Immunology. J. E. Coligan eds. Vol 
1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. 
U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin 11-Bennett, F., Giannotti, J., 
Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 
6.15.1 John Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 

30 9-Ciarletta, A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. 
J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which will identify, among others, proteins 
that affect APC-T cell interactions as well as direct T-cell effects by measuring proliferation and 
cytokine production) include, without limitation, those described in: Current Protocols in 

35 Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, 
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Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 

Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, 

Immunologic studies in Humans); Weinberger et ah, Proc. Natl. Acad. Sci USA 77:6091-6095, 

1980; Weinberger etal., Eur. J. Immun. 11:405-411, 1981; Takaietal., J. Immunol. 

5 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988. 

3.7.4 STEM CELL GROWTH FACTOR ACTIVITY 

A polypeptide of the present invention may exhibit stem cell growth factor activity and 
be involved in the proliferation, differentiation and survival of pluripotent and totipotent stem 

1 0 cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or 
germ line stem cells. Administration of the polypeptide of the invention to stem cells in vivo or 
ex vivo is expected to maintain and expand cell populations in a totipotential or pluripotential 
state which would be useful for re-engineering damaged or diseased tissues, transplantation, 
manufacture of bio-pharmaceuticals and the development of bio-sensors. The ability to produce 

1 5 large quantities of human cells has important working applications for the production of human 
proteins which currently must be obtained from non-human sources or donors, implantation of 
cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; 
tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including 
cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs 

20 for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines may 
be administered in combination with the polypeptide of the invention to achieve the desired 
effect, including any of the growth factors listed herein, other stem cell maintenance factors, and 
specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), Flt-3 ligand (Fit- 

25 3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, bone marrow 

inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet 
factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic 
fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion of 

30 these cells in culture will facilitate the production of large quantities of mature cells. Techniques 
for culturing stem cells are known in the art and administration of polypeptides of the invention, 
optionally with other growth factors and/or cytokines, is expected to enhance the survival and 
proliferation of the stem cell populations. This can be accomplished by direct administration of 
the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected 

35 with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder 
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layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers 

may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 

cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 

Stem cells themselves can be transfected with a polynucleotide of the invention to induce 
autocrine expression of the polypeptide of the invention. This will allow for generation of 
undifferentiated totipotential/pluripotential stem cell lines that are usefiil as is or that can then be 
differentiated into the desired mature cell types. These stable cell lines can also serve as a source 
of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for 
polymerase chain reaction experiments. These studies would allow for the isolation and 
identification of differentially expressed genes in stem cell populations that regulate stem cell 
proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in the 
treatment of many pathological conditions. For example, polypeptides of the present invention 
may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be 
used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or 
genetic disorders. The polypeptide of the invention may be useful for inducing the proliferation 
of neural cells and for the regeneration of nerve and brain tissue, i.e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic 
disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In 
addition, the expanded stem cell populations can also be genetically altered for gene therapy 
purposes and to decrease host rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 
manipulated to achieve controlled differentiation of the stem cells into more differentiated cell 
types. A broadly applicable method of obtaining pure populations of a specific differentiated 
cell type from undifferentiated stem cell populations involves the use of a cell-type specific 
promoter driving a selectable marker. The selectable marker allows only cells of the desired type 
to survive. For example, stem cells can be induced to differentiate into cardiomyocytes (Wobus 
et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) 
or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., 
Academic Press (1997)). Alternatively, directed differentiation of stem cells can be 
accomplished by culturing the stem cells in the presence of a differentiation factor such as 
retinoic acid and an antagonist of the polypeptide of the invention which would inhibit the 
effects of endogenous stem cell factor activity and allow differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the invention 
exhibits stem cell growth factor activity. Stem cells are isolated from any one of various cell 
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sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder 

layer, as described by Thompson et ah Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1995), in 

the presence of the polypeptide of the invention alone or in combination with other growth 

factors or cytokines. The ability of the polypeptide of the invention to induce stem cells 

5 proliferation is determined by colony formation on semi-solid support e.g. as described by 

Bernstein et al., Blood, 77: 2316-2321 (1991). 

3.7.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of hematopoiesis 

10 and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal 

biological activity in support of colony forming cells or of factor-dependent cell lines indicates 
involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of 
erythroid progenitor cells alone or in combination with other cytokines, thereby indicating 
utility, for example, in treating various anemias or for use in conjunction with 

1 5 irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid 
cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and 
monocytes/bone marrows (i.e., traditional CSF activity) useful, for example, in conjunction with 
chemotherapy to prevent or treat consequent myelo-suppression; in supporting the growth and 
proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or 

20 treatment of various platelet disorders such as thrombocytopenia, and generally for use in place 
of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of 
hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned 
hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such as 
those usually treated with transplantation, including, without limitation, aplastic anemia and 

25 paroxysmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment 
post irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) 
as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 

30 Suitable assays for proliferation and differentiation of various hematopoietic lines are 

cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., Molecular 
35 and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81:2903-2915, 1993. 
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Assays for stem cell survival and differentiation (which will identify, among others, 
proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 
Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. 
Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; Hirayama et al., 
5 Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1 992; Primitive hematopoietic colony forming cells 
with high proliferative potential, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic 
Cells. R. L Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et 
al., Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, 
Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, 
1 0 Wiley-Liss, Inc., New York, N.Y. 1 994; Long term bone marrow cultures in the presence of 
stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. I. 
Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long term culture 
initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. I. Freshney, et al. 
eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994. 

15 

3.7.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, tendon, 
ligament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue 
repair and replacement, and in healing of burns, incisions and ulcers. 

20 A polypeptide of the present invention that induces cartilage and/or bone growth in 

circumstances where bone is not normally formed has application in the healing of bone fractures 
and cartilage damage or defects in humans and other animals. Compositions of a polypeptide, 
antibody, binding partner, or other modulator of the invention may have prophylactic use in 
closed as well as open fracture reduction and also in the improved fixation of artificial joints. De 

25 novo bone formation induced by an osteogenic agent contributes to the repair of congenital, 
trauma induced, or oncologic resection induced craniofacial defects, and also is useful in 
cosmetic plastic surgery. 

A polypeptide of this invention may also be involved in attracting bone-forming cells, 
stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 

30 bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking 
inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) 
mediated by inflammatory processes may also be possible using the composition of the 
invention. 
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Another category of tissue regeneration activity that may involve the polypeptide of the 

present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue or other 

tissue formation in circumstances where such tissue is not normally formed, has application in 

the healing of tendon or ligament tears, defoimities and other tendon or ligament defects in 

5 humans and other animals. Such a preparation employing a tendon/ligament-like tissue inducing 

protein may have prophylactic use in preventing damage to tendon or ligament tissue, as well as 

use in the improved fixation of tendon or ligament to bone or other tissues, and in repairing 

defects to tendon or ligament tissue. De novo tendon/ligament-like tissue formation induced by 

a composition of the present invention contributes to the repair of congenital, trauma induced, or 

10 other tendon or ligament defects of other origin, and is also useful in cosmetic plastic surgery for 
attachment or repair of tendons or ligaments. The compositions of the present invention may 
provide environment to attract tendon- or ligament-forming cells, stimulate growth of tendon- or 
ligament-forming cells, induce differentiation of progenitors of tendon- or ligament-forming 
cells, or induce growth of tendon/ligament cells or progenitors ex vivo for return in vivo to effect 

15 tissue repair. The compositions of the invention may also be useful in the treatment of 

tendonitis, carpal tunnel syndrome and other tendon or ligament defects. The compositions may 
also include an appropriate matrix and/or sequestering agent as a carrier as is well known in the 
art. 

The compositions of the present invention may also be useful for proliferation of neural 

20 cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which 
involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a 
composition may be used in the treatment of diseases of the peripheral nervous system, such as 
peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous 

25 system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further, conditions that may be treated in 
accordance with the present invention include mechanical and traumatic disorders, such as spinal 
cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies 
resulting from chemotherapy or other medical therapies may also be treatable using a 

30 composition of the invention. 

Compositions of the invention may also be useful to promote better or faster closure of 
non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular 
insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 

35 regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, 
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kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular 

endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the 

desired effects may be by inhibition or modulation of fibrotic scarring may allow normal tissue 

to regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. 

5 A composition of the present invention may also be useful for gut protection or 

regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 

conditions resulting from systemic cytokine damage. 

A composition of the present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting the 
10 growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 

Assays for tissue generation activity include, without limitation, those described in: 
International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent 
Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
1 5 WO91/07491 (skin, endothelium). 

Assays for wound healing activity include, without limitation, those described in: Winter, 
Epidermal Wound Healing, pps. 71-1 12 (Maibach, H. I. and Rovee, D. T., eds.), Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 
71:382-84 (1978). 

20 

3-7.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or immune 
suppressing activity, including without limitation the activities for which assays are described 
herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A 

25 protein may be useful in the treatment of various immune deficiencies and disorders (including 
severe combined immunodeficiency (SCID)), e.g., in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells 
and other cell populations. These immune deficiencies may be genetic or be caused by viral (e.g., 
HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders. More 

30 specifically, infectious diseases causes by viral, bacterial, fungal or other infection may be 

treatable using a protein of the present invention, including infections by HIV, hepatitis viruses, 
herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such 
as candidiasis. Of course, in this regard, proteins of the present invention may also be useful 
where a boost to the immune system generally may be desirable, i.e., in the treatment of cancer. 
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Autoimmune disorders that may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, 
rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, 
autoimmune thyroiditis, insulin dependent diabetes mellitus, myasthenia gravis, graft-versus-host 
5 disease and autoimmune inflammatory eye disease. Such a protein (or antagonists thereof, 
including antibodies) of the present invention may also to be useful in the treatment of allergic 
reactions and conditions (e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect 
venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, 
angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, 

10 Stevens- Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
(particularly allergic asthma) or other respiratory problems. Other conditions, in which immune 
suppression is desired (including, for example, organ transplantation), may also be treatable 
using a protein (or antagonists thereof) of the present invention. The therapeutic effects of the 

15 polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals 

models such as the cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 
1998), skin prick test (Hoffinann et al., Allergy 54: 446-54, 1999), guinea pig skin sensitization 
test (Vohr et al., Arch. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., J. 
Toxicol. Environ. Health 53: 563-79). 

20 Using the proteins of the invention it may also be possible to modulate immune 

responses, in a number of ways. Down regulation may be in the form of inhibiting or blocking 
an immune response already in progress or may involve preventing the induction of an immune 
response. The functions of activated T cells may be inhibited by suppressing T cell responses or 
by inducing specific tolerance in T cells, or both. Immunosuppression of T cell responses is 

25 generally an active, non-antigen-specific, process that requires continuous exposure of the T cells 
to the suppressive agent. Tolerance, which involves inducing non-responsiveness or anergy in T 
cells, is distinguishable from immunosuppression in that it is generally antigen-specific and 
persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be 
demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence 

30 of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high 
level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and 
organ transplantation and in graft-versus-host disease (GVHD). For example, blockage of T cell 

35 function should result in reduced tissue destruction in tissue transplantation. Typically, in tissue 
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transplants, rejection of the transplant is initiated through its recognition as foreign by T cells, 

followed by an immune reaction that destroys the transplant. The administration of a therapeutic 

composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, 

and thus acts as an immunosuppressant. Moreover, a lack of costimulation may also be 

5 sufficient to anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term 

tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated 

administration of these blocking reagents. To achieve sufficient immunosuppression or 

tolerance in a subject, it may also be necessary to block the function of a combination of B 

lymphocyte antigens. 

1 0 The efficacy of particular therapeutic compositions in preventing organ transplant 

rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac grafts in 
rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine 
the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et 

15 al., Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 89:1 1 102-1 1 105 
(1992). In addition, murine models of GVHD (see Paul ed., Fundamental Immunology, Raven 
Press, New York, 1989, pp. 846-847) can be used to determine the effect of therapeutic 
compositions of the invention on the development of that disease. 

Blocking antigen function may also be therapeutically useful for treating autoimmune 

20 diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that 
are reactive against self-tissue and which promote the production of cytokines and 
autoantibodies involved in the pathology of the diseases. Preventing the activation of auto- 
reactive T cells may reduce or eliminate disease symptoms. Administration of reagents which 
block stimulation of T cells can be used to inhibit T cell activation and prevent production of 

25 autoantibodies or T cell-derived cytokines which may be involved in the disease process. 

Additionally, blocking reagents may induce antigen-specific tolerance of auto-reactive T cells 
which could lead to long-term relief from the disease. The efficacy of blocking reagents in 
preventing or alleviating autoimmune disorders can be determined using a number of 
well-characterized animal models of human autoimmune diseases. Examples include murine 

30 experimental autoimmune encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr mice or 

NZB hybrid mice, murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB 
rats, and murine experimental myasthenia gravis (see Paul ed., Fundamental Immunology, Raven 
Press, New York, 1 989, pp. 840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a means 

35 of up regulating immune responses, may also be useful in therapy. Upregulation of immune 
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responses may be in the form of enhancing an existing immune response or eliciting an initial 

immune response. For example, enhancing an immune response may be useful in cases of viral 

infection, including systemic viral diseases such as influenza, the common cold, and 

encephalitis. 

5 Alternatively, anti-viral immune responses may be enhanced in an infected patient by 

removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory form of 
a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the 
patient. Another method of enhancing anti-viral immune responses would be to isolate infected 

10 cells from a patient, transfect them with a nucleic acid encoding a protein of the present 

invention as described herein such that the cells express all or a portion of the protein on their 
surface, and reintroduce the transfected cells into the patient. The infected cells would now be 
capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation signal to T 

1 5 cells to induce a T cell mediated immune response against the transfected tumor cells. In 
addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to 
reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected with 
nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) of an 
MHC class I alpha chain protein and p 2 microglobulin protein or an MHC class II alpha chain 

20 protein and an MHC class II beta chain protein to thereby express MHC class I or MHC class II 
proteins on the cell surface. Expression of the appropriate class I or class II MHC in conjunction 
with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T 
cell mediated immune response against the transfected tumor cell. Optionally, a gene encoding 
an antisense construct which blocks expression of an MHC class II associated protein, such as 

25 the invariant chain, can also be cotransfected with a DNA encoding a peptide having the activity 
of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce 
tumor specific immunity. Thus, the induction of a T cell mediated immune response in a human 
subject may be sufficient to overcome tumor-specific tolerance in the subject. 

The activity of a protein of the invention may, among other means, be measured by the 

30 following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 
H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 

35 Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 
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78:2488-2492, 1981; Heirmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 

Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 137:3494-3500, 1986; Takai et al., J. 

Immunol. 140:508-512, 1988; Bowman et al., J. Virology 61:1992-1998; Bertagnolli et al., 

Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 1994. 

5 Assays for T-cell-dependent immunoglobulin responses and isotype switching (which 

will identify, among others, proteins that modulate T-cell dependent antibody responses and that 

affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. 

Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, 

Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 

10 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, proteins 
that generate predominantly Thl and CTL responses) include, without limitation, those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 

1 5 In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 
1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins expressed 
by dendritic cells that activate naive T-cells) include, without limitation, those described in: 

20 Guery et al., J. Immunol. 1 34:536-544, 1 995; Inaba et al., Journal of Experimental Medicine 

173:549-559, 1991; Macatoniaet al., Journal oflmmunology 154:5071-5079, 1995; Porgadoret 
al., Journal of Experimental Medicine 1 82:255-260, 1995; Nair et al., Journal of Virology 
67:4062-4069, 1993; Huang et al., Science 264:961-965, 1994; Macatonia et al., Journal of 
Experimental Medicine 169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 

25 94:797-807, 1994; and Inaba et al., Journal of Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins 
that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte 
homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 
13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et al., Cancer Research 

30 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, Journal oflmmunology 
145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., International 
Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and development 
include, without limitation, those described in: Antica et al., Blood 84:1 1 1-1 17, 1994; Fine et 



WO 01/74836 PCT/US01/10472 
al., Cellular Immunology 155:1 1 1-122, 1994; Galy et al., Blood 85:2770-2778, 1995; Toki et al., 

Proc. Nat. Acad. Sci. USA 88:7548-7551, 1991. 

3.7.8 ACTIVIN/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
5 activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the 
release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present invention, 
alone or in heterodimers with a member of the inhibin family, may be useful as a contraceptive 
1 0 based on the ability of inhibins to decrease fertility in female mammals and decrease 

spermatogenesis in male mammals. Administration of sufficient amounts of other inhibins can 
induce infertility in these mammals. Alternatively, the polypeptide of the invention, as a 
homodimer or as a heterodimer with other protein subunits of the inhibin group, may be useful as 
a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH 
15 release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A 
polypeptide of the invention may also be useful for advancement of the onset of fertility in 
sexually immature mammals, so as to increase the lifetime reproductive performance of domestic 
animals such as, but not limited to, cows, sheep and pigs. 

The activity of a polypeptide of the invention may, among other means, be measured by 
20 the following methods. 

Assays for activin/inhibin activity include, without limitation, those described in: Vale et 
al., Endocrinology 91 :562-572, 1 972; Ling et al., Nature 321 :779-782, 1986; Vale et al., Nature 
321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. Natl. Acad. Sci. 
USA 83:3091-3095, 1986. 

25 

3.7.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 

A polypeptide of the present invention may be involved in chemotactic or chemokinetic 
activity for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, 
T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A polynucleotide of the 
30 invention can encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic 
receptor activation can be used to mobilize or attract a desired cell population to a desired site of 
action. Chemotactic or chemokinetic compositions (e.g. proteins, antibodies, binding partners, 
or modulators of the invention) provide particular advantages in treatment of wounds and other 
trauma to tissues, as well as in treatment of localized infections. For example, attraction of 
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lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved 
immune responses against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell population. 
5 Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. 
Whether a particular protein has chemotactic activity for a population of cells can be readily 
determined by employing such protein or peptide in any known assay for cell chemotaxis. 
Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
1 0 chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for movement and adhesion include, 
without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. 
M. Kruisbeek, D. H. Marguiles, E. M Shevach, W. Strober, Pub. Greene Publishing Associates 
1 5 and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines 

6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 
1995; Mulleret al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 
1994; Johnston et al. J. of Immunol. 153:1762-1768, 1994. 

20 3-7.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 
attributes. Compositions may be useful in treatment of various coagulation disorders (including 
hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events 

25 in treating wounds resulting from trauma, surgery or other causes. A composition of the 
invention may also be useful for dissolving or inhibiting formation of thromboses and for 
treatment and prevention of conditions resulting therefrom (such as, for example, infarction of 
cardiac and central nervous system vessels (e.g., stroke)). 

Therapeutic compositions of the invention can be used in the following: 

30 Assay for hemostatic and thrombolytic activity include, without limitation, those 

described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis Res. 
45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 
35:467-474,1988. 

35 3.7.11 CANCER DIAGNOSIS AND THERAPY 
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Polypeptides of the invention may be involved in cancer cell generation, proliferation or 
metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 
invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For 
example, the presence or increased expression of a polynucleotide/polypeptide of the invention 
5 may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. 
. Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer 
condition. Identification of single nucleotide polymorphisms associated with cancer or a 
predisposition to cancer may also be useful for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 

10 inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor growth) 
and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. Therapeutic 
compositions of the invention may be effective in adult and pediatric oncology including in solid 
phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic 
cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, 

1 5 acute and chronic leukemias, and lymphomas, head and neck cancers including mouth cancer, 
larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell 
cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal 
cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers including 

20 bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian 
carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian follicle, 
kidney cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 
nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, 

25 tumor progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 
hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention (including 
inhibitors and stimulators of the biological activity of the polypeptide of the invention) may be 
administered to treat cancer. Therapeutic compositions can be administered in therapeutically 

30 effective dosages alone or in combination with adjuvant cancer therapy such as surgery, 

chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial 
effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise 
improving overall clinical condition, without necessarily eradicating the cancer. 

The composition can also.be administered in therapeutically effective amounts as a 

35 portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 
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modulator of the invention with one or more anti-cancer drugs in addition to a pharmaceutical^ 
acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. 
Anti-cancer drugs that are well known in the art and can be used as a treatment in combination 
with the polypeptide or modulator of the invention include: Actinomycin D, Aminoglutethimide, 
Asparaginase, Bleomycin, Busulfan, Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis- 
DDP), Cyclophosphamide, Cytarabine HC1 (Cytosine arabinoside), Dacarbazine, Dactinomycin, 
Daunorubicin HC1, Doxorubicin HC1, Estramustine phosphate sodium, Etoposide (VI 6-2 13), 
Floxuridine, 5-Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, 
Interferon Alpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), 
Lomustine, Mechlorethamine HC1 (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, 
Methotrexate (MTX), Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, 
Streptozocin, Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, 
Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 
Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for prophylactic 
treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. 
exposure to carcinogens) known in the art that predispose an individual to developing cancers. 
Under these circumstances, it may be beneficial to treat these individuals with therapeutically 
effective doses of the polypeptide of the invention to reduce the risk of developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays of 
cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) Culture of 
Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 and Ch 21), 
tumor systems in nude mice as described in Giovanella et ah, J. Natl. Can. Inst., 52: 921-30 
(1974), mobility and invasive potential of tumor cells in Boyden Chamber assays as described in 
Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction 
of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial 
cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1 1 89-97 (1999) and Li et a]., 
Clin. Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, 
e.g. from American Type Tissue Culture Collection catalogs. 

3.7.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the 
invention can encode a polypeptide exhibiting such characteristics. Examples of such receptors 
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and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and 

their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions 

and their ligands (including without limitation, cellular adhesion molecules (such as selecting 

integrins and their ligands) and receptorAigand pairs involved in antigen presentation, antigen 

5 recognition and development of cellular and humoral immune responses. Receptors and ligands 

are also useful for screening of potential peptide or small molecule inhibitors of the relevant 

receptor/ligand interaction. A protein of the present invention (including, without limitation, 

fragments of receptors and ligands) may themselves be useful as inhibitors of receptor/ligand 

interactions. 

10 The activity of a polypeptide of the invention may, among other means, be measured by 

the following methods: 

Suitable assays for receptor-ligand activity include without limitation those described in: 

Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. 

Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, 
1 5 Measurement of Cellular Adhesion under static conditions 7.28 . 1 - 7.28.22), Takai et al., Proc. 

NatL Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., J. Exp. Med. 168:1 145-1 156, 1988; 

Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 

175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor for a 
20 ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be identified 

through binding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel 

overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a 

partial antagonist require the use of other proteins as competing ligands. The polypeptides of the 
25 present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, 

colorimetric molecules or toxin molecules by conventional methods. ("Guide to Protein 

Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 (1990) Academic 

Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to, tritium and 

carbon- 14. Examples of colorimetric molecules include, but are not limited to, fluorescent 
30 molecules such as fluorescamine, or rhodamine or other colorimetric molecules. Examples of 

toxins include, but are not limited, to ricin. 
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3.7.13 DRUG SCREENING 

This invention is particularly useful for screening chemical compounds by using the 
novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques. 
The polypeptides or fragments employed in such a test may either be free in solution, affixed to a 
5 solid support, borne on a cell surface or located intracellularly. One method of drug screening 
utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the polypeptide or a fragment thereof. Drugs are screened against such 
transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can 
be used for standard binding assays. One may measure, for example, the formation of 

1 0 complexes between polypeptides of the invention or fragments and the agent being tested or 
examine the diminution in complex formation between the novel polypeptides and an 
appropriate cell line, which are well known in the art. 

Sources for test compounds that may be screened for ability to bind to or modulate (i.e., 
increase or decrease) the activity of polypeptides of the invention include (1) inorganic and 

1 5 organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 

Chemical libraries may be readily synthesized or purchased from a number of 
commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 

20 The sources of natural product libraries are microorganisms (including bacteria and 

fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 
screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine 
microorganisms or (2) extraction of the organisms themselves. Natural product libraries include 
polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a 

25 review, see Science 252:63-68 (1 998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides or 
organic compounds and can be readily prepared by traditional automated synthesis methods, 
PCR, cloning or proprietary synthetic methods. Of particular interest are peptide and 
oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, protein, 

30 peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries. 
For a review of combinatorial chemistry and libraries created therefrom, see Myers, Curr. Opin. 
Biotechnol 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see 
Al-Obeidi et ah, Mol Biotechnol, 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol, 
1(1):1 14-19 (1997); Dorner et al., Bioorg Med Chem, 4(5):709-15 (1996) (alkylated dipeptides). 
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Identification of modulators through use of the various libraries described herein permits 
modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to bind a 
polypeptide of the invention. The molecules identified in the binding assay are then tested for 
antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the 
5 art. In brief, the molecules are titrated into a plurality of cell cultures or animals and then tested 
for either cell/animal death or prolonged survival of the animal/cells. 

The binding molecules thus identified may be complexed with toxins, e.g., ricin or 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding 
molecule complex is then targeted to a tumor or other cell by the specificity of the binding 
10 molecule for a polypeptide of the invention. Alternatively, the binding molecules may be 
complexed with imaging agents for targeting and imaging purposes. 

3-7.14 ASSAY FOR RECEPTOR ACTIVITY 

The invention also provides methods to detect specific binding of a polypeptide e.g. a 

1 5 ligand or a receptor. The art provides numerous assays particularly useful for identifying 

previously unknown binding partners for receptor polypeptides of the invention. For example, 
expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used 
to identify polynucleotides encoding binding partners. As another example, affinity 
chromatography with the appropriate immobilized polypeptide of the invention can be used to 

20 isolate polypeptides that recognize and bind polypeptides of the invention. There are a number 
of different libraries used for the identification of compounds, and in particular small molecules, 
that modulate {i.e., increase or decrease) biological activity of a polypeptide of the invention. 
Ligands for receptor polypeptides of the invention can also be identified by adding exogenous 
ligands, or cocktails of ligands to two cells populations that are genetically identical except for 

25 the expression of the receptor of the invention: one cell population expresses the receptor of the 
invention whereas the other does not. The response of the two cell populations to the addition of 
ligands(s) is then compared. Alternatively, an expression library can be co-expressed with the 
polypeptide of the invention in cells and assayed for an autocrine response to identify potential 
ligand(s). As still another example, BIAcore assays, gel overlay assays, or other methods known 

30 in the art can be used to identify binding partner polypeptides, including, (1) organic and 
inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of random peptides, oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade of the 
* polypeptide of the invention can be determined. For example, a chimeric protein in which the 

35 cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a 
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protein, whose ligand has been identified, is produced in a host cell. The cell is then incubated 

with the ligand specific for the extracellular portion of the chimeric protein, thereby activating 

the chimeric receptor. Known downstream proteins involved in intracellular signaling can then 

be assayed for expected modifications i.e. phosphorylation. Other methods known to those in 

5 the art can also be used to identify signaling molecules involved in receptor activity. 

3.7.15 ANTI-INPLAMMATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory activity. The 
anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the 

10 inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, 
cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory 
process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production 
of other factors which more directly inhibit or promote an inflammatory response. Compositions 
with such activities can be used to treat inflammatory conditions including chronic or acute 

1 5 conditions), including without limitation intimation associated with infection (such as septic 

shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, 
endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from 
over production of cytokines such as TNF or IL-1 . Compositions of the invention may also be 

20 useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 

Compositions of this invention may be utilized to prevent or treat conditions such as, but not 
limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid 
arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, 
graft versus host disease, inflammatory bowel disease, inflammation associated with pulmonary 

25 disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 
acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 

3.7.16 LEUKEMIAS 

30 Leukemias and related disorders may be treated or prevented by administration of a 

therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the 
invention. Such leukemias and related disorders include but are not limited to acute leukemia, 
acute lymphocytic leukemia, acute myelocytic leukemia, myeloblasts, promyelocyte, 
myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic myelocytic 
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(granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see 
Fishman et al., 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 



3.7.17 NERVOUS SYSTEM DISORDERS 

5 Nervous system disorders, involving cell types which can be tested for efficacy of 

intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 
therapeutic utility, include but are not limited to nervous system injuries, and diseases or 
disorders which result in either a disconnection of axons, a diminution or degeneration of 
1 0 neurons, or demyelination. Nervous system lesions which may be treated in a patient (including 
human and non-human mammalian patients) according to the invention include but are not 
limited to the following lesions of either the central (including spinal cord, brain) or peripheral 
nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated with 

1 5 surgery, for example, lesions that sever a portion of the nervous system, or compression injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 
infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 

20 injured as a result of infection, for example, by an abscess or associated with infection by human 
immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, 
tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration associated 

25 with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral 
sclerosis; 

(v) lesions associated with nutritional diseases or disorders, in which a portion of the 
nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B12 deficiency, folic acid deficiency, Wernicke disease, 

30 tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary degeneration of the corpus 
callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated with systemic diseases including but not limited to 
diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, carcinoma, or 
sarcoidosis; 

61 



WO 01/74836 PCT/US01/10472 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
injured by a demyelinating disease including but not limited to multiple sclerosis, human 

5 immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 

Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival or 
differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit 
10 any of the following effects may be useful according to the invention: 

(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, e.g., 
choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 

1 5 (iv) decreased symptoms of neuron dysfunction in vivo. 

Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the method set 
forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may 
be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or Brown et al. 

20 (1981, Ann. Rev. Neurosci. 4:17-42); increased production of neuron-associated molecules may 
be measured by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., 
depending on the molecule to be measured; and motor neuron dysfunction may be measured by 
assessing the physical manifestation of motor neuron disorder, e.g., weakness, motor neuron 
conduction velocity, or functional disability. 

25 In specific embodiments, motor neuron disorders that may be treated according to the 

invention include but are not limited to disorders such as infarction, infection, exposure to toxin, 
trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as 
well as other components of the nervous system, as well as disorders that selectively affect 
neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal 

30 muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile 
muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), 
poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy 
(Charcot-Marie-Tooth Disease). 
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3.7.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, 
including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing 
5 or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye 
color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape 
(such as, for example, breast augmentation or diminution, change in bone form or shape); 
effecting biorhythms or circadian cycles or rhythms; effecting the fertility of male or female 
subjects; effecting the metabolism, catabolism, anabolism, processing, utilization, storage or 

1 0 elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other 
nutritional factors or component(s); effecting behavioral characteristics, including, without 
limitation, appetite, libido, stress, cognition (including cognitive disorders), depression 
(including depressive disorders) and violent behaviors; providing analgesic effects or other pain 
reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other 

15 than hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correcting 
deficiencies of the enzyme and treating deficiency-related diseases; treatment of 
hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such 
as, for example, the ability to bind antigens or complement); and the ability to act as an antigen 
in a vaccine composition to raise an immune response against such protein or another material or 

20 entity which is cross-reactive with such protein. 

3.7.19 IDENTIFICATION OF POLYMORPHISMS 

The demonstration of polymorphisms makes possible the identification of such 
polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis 

25 and treatment. Such polymorphisms may be associated with, e.g., differential predisposition or 
susceptibility to various disease states (such as disorders involving inflammation or immune 
response) or a differential response to drug administration, and this genetic information can be 
used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a 
polymorphism associated with a predisposition to inflammation or autoimmune disease makes 

30 possible the diagnosis of this condition in humans by identifying the presence of the 
polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample from a patient, analyzing DNA from the sample, optionally 
involving isolation or amplification of the DNA, and identifying the presence of the 
35 polymorphism in the DNA. For example, PCR may be used to amplify an appropriate fragment 
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of genomic DNA, which may then be sequenced. Alternatively, the DNA may be subjected to 
allele-specific oligonucleotide hybridization (in which appropriate oligonucleotides are 
hybridized to the DNA under conditions permitting detection of a single base mismatch) or to a 
single nucleotide extension assay (in which an oligonucleotide that hybridizes immediately 
5 adjacent to the position of the polymorphism is extended with one or more labeled nucleotides). 
In addition, traditional restriction fragment length polymorphism analysis (using restriction 
enzymes that provide differential digestion of the genomic DNA depending on the presence or 
absence of the polymorphism) may be performed. Arrays with nucleotide sequences of the 
present invention can be used to detect polymorphisms. The array can comprise modified 

10 nucleotide sequences of the present invention in order to detect the nucleotide sequences of the 
present invention. In the alternative, any one of the nucleotide sequences of the present 
invention can be placed on the array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 
also be detected by detecting a corresponding change in amino acid sequence of the protein, e.g., 

15 by an antibody specific to the variant sequence. 

3.7.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against rheumatoid 
arthritis are determined in an experimental animal model system. The experimental model 

20 system is adjuvant induced arthritis in rats, and the protocol is described by J. Holoshitz, et al., 
1983, Science, 219:56, or by B. Waksman et al., 1963, Int. Arch. Allergy Appl. Immunol., 
23:129. Induction of the disease can be caused by a single injection, generally intradermally, of a 
suspension of killed Mycobacterium tuberculosis in complete Freund's adjuvant (CFA). The 
route of injection can vary, but rats may be injected at the base of the tail with an adjuvant 

25 mixture. The polypeptide is administered in phosphate buffered solution (PBS) at a dose of 
about 1-5 mg/kg. The control consists of administering PBS only. 

The procedure for testing the effects of the test compound would consist of intradermally 
injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the 
test compound and subsequent treatment every other day until day 24. At 14, 15, 18, 20, 22, and 

30 24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as 
described by J. Holoskitz above. An analysis of the data would reveal that the test compound 
would have a dramatic affect on the swelling of the joints as measured by a decrease of the 
arthritis score. 

35 3.8 THERAPEUTIC METHODS 
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The compositions (including polypeptide fragments, analogs, variants and antibodies or 

other binding partners or modulators including antisense polynucleotides) of the invention have 

numerous applications in a variety of therapeutic methods. Examples of therapeutic applications 

include, but are not limited to, those exemplified herein. 

5 

3.8.1 EXAMPLE 

One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. While the mode of 

10 administration is not particularly important, parenteral administration is preferred. An 
exemplary mode of administration is to deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, weight, 
condition and response of the individual patient. Typically, the amount of polypeptide 

15 administered per dose will be in the range of about O.Oljig/kg to 100 mg/kg of body weight, with 
the preferred dose being about 0.1 jug/kg to 1 0 mg/kg of patient body weight. For parenteral 
administration, polypeptides of the invention will be formulated in an injectable form combined 
with a pharmaceutically acceptable parenteral vehicle. Such vehicles are well known in the art 
and examples include water, saline, Ringer's solution, dextrose solution, and solutions consisting 

20 of small amounts of the human serum albumin. The vehicle may contain minor amounts of 

additives that maintain the isotonicity and stability of the polypeptide or other active ingredient. 
The preparation of such solutions is within the skill of the art. 

3.9 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
25 ADMINISTRATION 

A protein or other composition of the present invention (from whatever source derived, 
including without limitation from recombinant and non-recombinant sources and including 
antibodies and other binding partners of the polypeptides of the invention) may be administered 
to a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable 

30 carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition 
may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, 
fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredient(s). The characteristics of the 

35 earner will depend on the route of administration. The pharmaceutical composition of the 
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invention may also contain cytokines, lymphokines, or other hematopoietic factors such as 
M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-1 1, IL-12, 
IL-13, IL-14, IL-1 5, IFN, TNFO, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell 
factor, and erythropoietin. In further compositions, proteins of the invention may be combined 
with other agents beneficial to the treatment of the disease or disorder in question. These agents 
include various growth factors such as epidermal growth factor (EGF), platelet-derived growth 
factor (PDGF), transforming growth factors (TGF-a and TGF-P), insulin-like growth factor 
(IGF), as well as cytokines described herein. 

The pharmaceutical composition may further contain other agents that either enhance the 
activity of the protein or other active ingredient or complement its activity or use in treatment. 
Such additional factors and/or agents may be included in the pharmaceutical composition to 
produce a synergistic effect with protein or other active ingredient of the invention, or to 
minimize side effects. Conversely, protein or other active ingredient of the present invention 
may be included in formulations of the particular clotting factor, cytokine, lymphokine, other 
hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti- inflammatory agent to 
minimize side effects of the clotting factor, cytokine, lymphokine, other hematopoietic factor, 
thrombolytic or anti-thrombotic factor, or anti-inflammatory agent (such as IL-IRa, IL-1 Hyl, 
IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein of the present 
invention may be active in multimers (e.g., heterodimers or homodimers) or complexes with 
itself or other proteins. As a result, pharmaceutical compositions of the invention may comprise 
a protein of the invention in such multimeric or complexed form. 

As an alternative to being included in a pharmaceutical composition of the invention 
including a first protein, a second protein or a therapeutic agent may be concurrently 
administered with the first protein (e.g., at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 
Techniques for formulation and administration of the compounds of the instant application may 
be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition. A therapeutically effective dose further refers to that amount of the compound sufficient 
to result in amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of the 
relevant medical condition, or an increase in rate of treatment, healing, prevention or 
amelioration of such conditions. When applied to an individual active ingredient, administered 
alone, a therapeutically effective dose refers to that ingredient alone. When applied to a 
combination, a therapeutically effective dose refers to combined amounts of the active 
ingredients that result in the therapeutic effect, whether administered in combination, serially or 
simultaneously. 
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In practicing the method of treatment or use of the present invention, a therapeutically 

effective amount of protein or other active ingredient of the present invention is administered to 

a mammal having a condition to be treated. Protein or other active ingredient of the present 

invention may be administered in accordance with the method of the invention either alone or in 

5 combination with other therapies such as treatments employing cytokines, lymphokines or other 

hematopoietic factors. When co- administered with one or more cytokines, lymphokines or other 

hematopoietic factors, protein or other active ingredient of the present invention may be 

administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic 

factor(s), thrombolytic or antithrombotic factors, or sequentially. If administered sequentially, 

10 the attending physician will decide on the appropriate sequence of administering protein or other 

active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other 

hematopoietic factor(s), thrombolytic or antithrombotic factors. 

3.9.1 ROUTES OF ADMINISTRATION 

15 Suitable routes of administration may, for example, include oral, rectal, transmucosal, or 

intestinal administration; parenteral delivery, including intramuscular, subcutaneous, 
intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, 
intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active 
ingredient of the present invention used in the pharmaceutical composition or to practice the 

20 method of the present invention can be carried out in a variety of conventional ways, such as oral 
ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral 
or intravenous injection. Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic manner, for 
example, via injection of the compound directly into arthritic joints or in fibrotic tissue, often in 

25 a depot or sustained release formulation. In order to prevent the scarring process frequently 
occurring as complication of glaucoma surgery, the compounds may be administered topically, 
for example, as eye drops. Furthermore, one may administer the drug in a targeted drug delivery 
system, for example, in a liposome coated with a specific antibody, targeting, for example, 
arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the 

30 afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an effective 
dosage to the desired site of action. The determination of a suitable route of administration and 
an effective dosage for a particular indication is within the level of skill in the art. Preferably for 
wound treatment, one administers the therapeutic compound directly to the site. Suitable dosage 
35 ranges for the polypeptides of the invention can be extrapolated from these dosages or from 

67 



WO 01/74836 PCT/US01/10472 
similar studies in appropriate animal models. Dosages can then be adjusted as necessary by the 
clinician to provide maximal therapeutic benefit. 



3.9.2 COMPOSITIONS/FORMULATIONS 

5 Pharmaceutical compositions for use in accordance with the present invention thus may 

be formulated in a conventional manner using one or more physiologically acceptable carriers 
comprising excipients and auxiliaries which facilitate processing of the active compounds into 
preparations that can be used pharmaceutical^. These pharmaceutical compositions may be 
manufactured in a maimer that is itself known, e.g., by means of conventional mixing, 

10 dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 
lyophilizing processes. Proper formulation is dependent upon the route of administration 
chosen. When a therapeutically effective amount of protein or other active ingredient of the 
present invention is administered orally, protein or other active ingredient of the present 
invention will be in the form of a tablet, capsule, powder, solution or elixir. When administered 

15 in tablet form, the pharmaceutical composition of the invention may additionally contain a solid 
carrier such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 
95% protein or other active ingredient of the present invention, and preferably from about 25 to 
90% protein or other active ingredient of the present invention. When administered in liquid 
form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, 

20 mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the 
pharmaceutical composition may further contain physiological saline solution, dextrose or other 
saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. 
When administered in liquid form, the pharmaceutical composition contains from about 0.5 to 
90% by weight of protein or other active ingredient of the present invention, and preferably from 

25 about 1 to 50% protein or other active ingredient of the present invention. 

When a therapeutically effective amount of protein or other active ingredient of the 
present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or 
other active ingredient of the present invention will be in the form of a pyrogen-free, parenterally 
acceptable aqueous solution. The preparation of such parenterally acceptable protein or other 

30 active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within 
the skill in the art. A preferred pharmaceutical composition for intravenous, cutaneous, or 
subcutaneous injection should contain, in addition to protein or other active ingredient of the 
present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, 
Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or 

35 other vehicle as known in the art. The pharmaceutical composition of the present invention may 
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also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of 

skill in the art. For injection, the agents of the invention may be formulated in aqueous 

solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's 

solution, or physiological saline buffer. For transmucosal administration, penetrants appropriate 

to the barrier to be permeated are used in the formulation. Such penetrants are generally known 

in the art. 

For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutically acceptable carriers well known in the art. Such carriers 
enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, 
liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 
treated. Pharmaceutical preparations for oral use can be obtained from a solid excipient, 
optionally grinding a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in 
particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose 
preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, 
gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents 
may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt 
thereof such as sodium alginate. Dragee cores are provided with suitable coatings. For this 
purpose, concentrated sugar solutions may be used, which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be 
added to the tablets or dragee coatings for identification or to characterize different combinations 
of active compound doses. 

Pharmaceutical preparations that can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 
sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 
suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. All formulations for oral administration should be in dosages suitable 
for such administration. For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from 
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pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 

dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or 

other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by 

providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin for use 

in an inhaler or insufflator may be formulated containing a powder mix of the compound and a 

suitable powder base such as lactose or starch. The compounds may be formulated for parenteral 

administration by injection, e.g., by bolus injection or continuous infusion. Formulations for 

injection may be presented in unit dosage form, e.g., in ampules or in multi-dose containers, with 

an added preservative. The compositions may take such forms as suspensions, solutions or 

emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, 

stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of 
the active compounds in water-soluble form. Additionally, suspensions of the active compounds 
may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or 
vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or 
triglycerides, or liposomes. Aqueous injection suspensions may contain substances that increase 
the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. 
Optionally, the suspension may also contain suitable stabilizers or agents that increase the 
solubility of the compounds to allow for the preparation of highly concentrated solutions. 
Alternatively, the active ingredient may be in powder form for constitution with a suitable 
vehicle, e.g., sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories or 
retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 
glycerides. In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a co-solvent 
system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, and 
an aqueous phase. The co-solvent system may be the VPD co-solvent system. VPD is a solution 
of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v 
polyethylene glycol 300, made up to volume in absolute ethanol. The VPD co-solvent system 
(VPD:5W) consists of VPD diluted 1:1 with 5% dextrose in water solution. This co-solvent 
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system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic 

administration. Naturally, the proportions of a co-solvent system may be varied considerably 

without destroying its solubility and toxicity characteristics. Furthermore, the identity of the 

co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may 

5 be used instead of polysorbate 80; the fraction size of polyethylene glycol may be varied; other 

biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other 

sugars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for 

hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well 

known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents 

1 0 such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. 
Various types of sustained-release materials have been established and are well known by those 
skilled in the art. Sustained-release capsules may, depending on their chemical nature, release the 

1 5 compounds for a few weeks up to over 1 00 days. Depending on the chemical nature and the 
biological stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase carriers 
or excipients. Examples of such carriers or excipients include but are not limited to calcium 

20 carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 

polymers such as polyethylene glycols. Many of the active ingredients of the invention may be 
provided as salts with pharmaceutical^ compatible counter ions. Such pharmaceutical^ 
acceptable base addition salts are those salts which retain the biological effectiveness and 
properties of the free acids and which are obtained by reaction with inorganic or organic bases 

25 such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, 

monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and 
the like. 

The pharmaceutical composition of the invention may be in the form of a complex of the 
protein(s) or other active ingredient(s) of present invention along with protein or peptide 

30 antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
lymphocytes. B-Iymphocytes will respond to antigen through their surface immunoglobulin 
receptor. T-lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins including 
those encoded by class I and class II MHC genes on host cells will serve to present the peptide 

35 antigen(s) to T lymphocytes. The antigen components could also be supplied as purified 
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MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells. 

Alternatively antibodies able to bind surface immunoglobulin and other molecules on B cells as 

well as antibodies able to bind the TCR and other molecules on T cells can be combined with the 

pharmaceutical composition of the invention. 

5 The pharmaceutical composition of the invention may be in the form of a liposome in 

which protein of the present invention is combined, in addition to other pharmaceutically 

acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 

micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable 

lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, 

10 sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such 

liposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. 

Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated herein 

by reference. 

The amount of protein or other active ingredient of the present invention in the 

15 pharmaceutical composition of the present invention will depend upon the nature and severity of 
the condition being treated, and on the nature of prior treatments that the patient has undergone. 
Ultimately, the attending physician will decide the amount of protein or other active ingredient 
of the present invention with which to treat each individual patient. Initially, the attending 
physician will administer low doses of protein or other active ingredient of the present invention 

20 and observe the patient's response. Larger doses of protein or other active ingredient of the 
present invention may be administered until the optimal therapeutic effect is obtained for the 
patient, and at that point the dosage is not increased further. It is contemplated that the various 
pharmaceutical compositions used to practice the method of the present invention should contain 
about 0.01 |ig to about 100 mg (preferably about 0.1 fig to about 10 mg, more preferably about 

25 0.1 ng to about 1 mg) of protein or other active ingredient of the present invention per kg body 
weight. For compositions of the present invention that are useful for bone, cartilage, tendon or 
ligament regeneration, the therapeutic method includes administering the composition topically, 
systematically, or locally as an implant or device. When administered, the therapeutic 
composition for use in this invention is, of course, in a pyrogen-free, physiologically acceptable 

30 form. Further, the composition may desirably be encapsulated or injected in a viscous form for 
delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable 
for wound healing and tissue repair. Therapeutically useful agents other than a protein or other 
active ingredient of the invention that may also optionally be included in the composition as 
described above, may alternatively or additionally, be administered simultaneously or 

35 sequentially with the composition in the methods of the invention. Preferably for bone and/or 
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cartilage formation, the composition would include a matrix capable of delivering the 
protein-containing or other active ingredient-containing composition to the site of bone and/or 
cartilage damage, providing a structure for the developing bone and cartilage and optimally 
capable of being resorbed into the body. Such matrices may be formed of materials presently in 
5 use for other implanted medical applications. 

The choice of matrix material is based on biocompatibility, biodegradability, mechanical 
properties, cosmetic appearance and interface properties. The particular application of the 
compositions will define the appropriate formulation. Potential matrices for the compositions 
may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, 

10 hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Other potential materials 
are biodegradable and biologically well-defined, such as bone or dermal collagen. Further 
matrices are comprised of pure proteins or extracellular matrix components. Other potential 
matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above- 

15 mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalcium phosphate. The bioceramics may be altered in composition, such as in 
calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and 
glycolic acid in the form of porous particles having diameters ranging from 150 to 800 microns. 

20 In some applications, it will be useful to utilize a sequestering agent, such as carboxymethyl 

cellulose or autologous blood clot, to prevent the protein compositions from disassociating from 
the matrix. 

A preferred family of sequestering agents is cellulosic materials such as alkylcelluloses 
(including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, 

25 hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose, and 

carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose 
(CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, poly 
(ethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and poly (vinyl alcohol). The 
amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-1 0 wt % based on total 

30 formulation weight, which represents the amount necessary to prevent desorption of the protein 
from the polymer matrix and to provide appropriate handling of the composition, yet not so 
much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the 
protein the opportunity to assist the osteogenic activity of the progenitor cells. In further 
compositions, proteins or other active ingredients of the invention may be combined with other 

35 agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 
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question. These agents include various growth factors such as epidermal growth factor (EGF), 

platelet derived growth factor (PDGF), transforming growth factors (TGF-a and TGF-p), and 

insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 

5 Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 

patients for such treatment with proteins or other active ingredients of the present invention. The 

dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 

regeneration will be determined by the attending physician considering various factors which 

modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of 

10 damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (eg., 
bone), the patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. The dosage may vary with the type of matrix used in the reconstitution 
and with inclusion of other proteins in the pharmaceutical composition. For example, the 
addition of other known growth factors, such as IGF I (insulin like growth factor I), to the final 

1 5 composition, may also affect the dosage. Progress can be monitored by periodic assessment of 
tissue/bone growth and/or repair, for example, X-rays, histomorphometric determinations and 
tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 

20 mammalian subject. Polynucleotides of the invention may also be administered by other known 
methods for introduction of nucleic acid into a cell or organism (including, without limitation, in 
the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 

25 

3.9.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve its 
intended purpose. More specifically, a therapeutically effective amount means an amount 

30 effective to prevent development of or to alleviate the existing symptoms of the subject being 
treated. Determination of the effective amount is well within the capability of those skilled in 
the art, especially in light of the detailed disclosure provided herein. For any compound used in 
the method of the invention, the therapeutically effective dose can be estimated initially from 
appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a 

35 circulating concentration range that can be used to more accurately determine useful doses in 
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humans. For example, a dose can be formulated in animal models to achieve a circulating 

concentration range that includes the IC 50 as determined in cell culture (i.e., the concentration of 

the test compound which achieves a half-maximal inhibition of the protein's biological activity). 

Such information can be used to more accurately determine useful doses in humans. 

5 A therapeutically effective dose refers to that amount of the compound that results in 

amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 

efficacy of such compounds can be determined by standard pharmaceutical procedures in cell 

cultures or experimental animals, e.g., for determining the LD 50 (the dose lethal to 50% of the 

population) and the ED 50 (the dose therapeutically effective in 50% of the population). The dose 

1 0 ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the 
ratio between LD50 and ED50. Compounds that exhibit high therapeutic indices are preferred. 
The data obtained from these cell culture assays and animal studies can be used in formulating a 
range of dosage for use in human. The dosage of such compounds lies preferably within a range 
of circulating concentrations that include the ED 50 with little or no toxicity. The dosage may 

1 5 vary within this range depending upon the dosage form employed and the route of administration 
utilized. The exact formulation, route of administration and dosage can be chosen by the 
individual physician in view of the patient's condition. See, e.g., Fingl et al., 1975, in "The 
Pharmacological Basis of Therapeutics", Ch. 1 p.l . Dosage amount and interval may be adjusted 
individually to provide plasma levels of the active moiety that are sufficient to maintain the 

20 desired effects, or minimal effective concentration (MEC). The MEC will vary for each 

compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will 
depend on individual characteristics and route of administration. However, HPLC assays or 
bioassays can be used to determine plasma concentrations. 

Dosage intervals can also be determined using MEC value. Compounds should be 

25 administered using a regimen that maintains plasma levels above the MEC for 1 0-90% of the 
time, preferably between 30-90% and most preferably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be 
related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the invention 

30 will be in the range of about 0.01 ng/kg to 100 mg/kg of body weight daily, with the preferred 
dose being about 0.1 ^ig/kg to 25 mg/kg of patient body weight daily, varying in adults and 
children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter 
intervals. 
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The amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 

3.9.4 PACKAGING 

The compositions may, if desired, be presented in a pack or dispenser device that may 
contain one or more unit dosage forms containing the active ingredient. The pack may, for 
example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may 
be accompanied by instructions for administration. Compositions comprising a compound of the 
invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labeled for treatment of an indicated condition. 

3.10 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of the 
invention. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that contain 
an antigen binding site that specifically binds (immunoreacts with) an antigen. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F a t>, F ab ' and F^ 2 
fragments, and an F ab expression library. In general, an antibody molecule obtained from 
humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ from one another 
by the nature of the heavy chain present in the molecule. Certain classes have subclasses as well, 
such as IgGi, IgG 2 , and others. Furthermore, in humans, the light chain may be a kappa chain or 
a lambda chain. Reference herein to antibodies includes a reference to all such classes, 
subclasses and types of human antibody species. 

An isolated related protein of the invention may be intended to serve as an antigen, or a 
portion or fragment thereof, and additionally can be used as an immunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for polyclonal 
and monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 
invention provides antigenic peptide fragments of the antigen for use as immunogens. An 
antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid sequence 
of the full length protein, such as an amino acid sequence shown in SEQ ID NO: 85-167, or 252- 
335, and encompasses an epitope thereof such that an antibody raised against the peptide forms a 
specific immune complex with the full length protein or with any fragment that contains the 
epitope. Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 
1 5 amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 
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Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 

located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 

antigenic peptide is a region of the protein that is located on the surface of the protein, e.g., a 
5 hydrophilic region. A hydrophobicity analysis of the human related protein sequence will 

indicate which regions of a related protein are particularly hydrophilic and, therefore, are likely 

to encode surface residues useful for targeting antibody production. As a means for targeting 

antibody production, hydropathy plots showing regions of hydrophilicity and hydrophobicity 

may be generated by any method well known in the art, including, for example, the Kyte 
10 Doolittle or the Hopp Woods methods, either with or without Fourier transformation. See, e.g., 

Hopp and Woods, 1981, Proc. Nat. Acad Sci. USA 78: 3824-3828; Kyte and Doolittle 1982, J. 

Mol Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 

Antibodies that are specific for one or more domains within an antigenic protein, or derivatives, 

fragments, analogs or homologs thereof, are also provided herein. 
15 A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 

thereof, may be utilized as an immunogen in the generation of antibodies that 

immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of polyclonal or 

monoclonal antibodies directed against a protein of the invention, or against derivatives, 
20 fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 

Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 



3.10.1 POLYCLONAL ANTIBODIES 

25 For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, 

goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 

30 recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated to 
a second protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 
bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further include an 
adjuvant. Various adjuvants used to increase the immunological response include, but are not 

35 limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface 
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active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 

dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin and 

Corynebacterium parvum, or similar immunostimulatory agents. Additional examples of 

adjuvants which can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, 

synthetic trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can be 

isolated from the mammal (e.g., from the blood) and further purified by well known techniques, 

such as affinity chromatography using protein A or protein G, which provide primarily the IgG 

fraction of immune serum. Subsequently, or alternatively, the specific antigen which is the 

target of the immunoglobulin sought, or an epitope thereof, may be immobilized on a column to 

purify the immune specific antibody by immunoaffinity chromatography. Purification of 

immunoglobulins is discussed, for example, by D. Wilkinson (The Scientist, published by The 

Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28). 

3.10.2 MONOCLONAL ANTIBODIES 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as used 
herein, refers to a population of antibody molecules that contain only one molecular species of 
antibody molecule consisting of a unique light chain gene product and a unique heavy chain gene 
product. In particular, the complementarity determining regions (CDRs) of the monoclonal 
antibody are identical in all the molecules of the population. MAbs thus contain an antigen 
binding site capable of immunoreacting with a particular epitope of the antigen characterized by 
a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature, 256:495 (1 975). In a hybridoma method, a mouse, 
hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind 
to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof or a 
fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of human 
origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused with an immortalized cell line using a suitable 
fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal 
Antibodies: Principles and Practice. Academic Press, (1986) pp. 59-103). Immortalized cell 
lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 
human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can 
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be cultured in a suitable culture medium that preferably contains one or more substances that 
inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental 
cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and 
5 thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a medium 
such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which 
can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, 
10 California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and 
mouse-human heteromyeloma cell lines also have been described for the production of human 
monoclonal antibodies (Kozbor, J. Immunol.. 133:3001 (1984); Brodeur et aL, Monoclonal 
Antibody Production Techniques and Applications, Marcel Dekker, Inc., New York, (1987) pp. 
51-63). 

1 5 The culture medium in which the hybridoma cells are cultured can then be assayed for 

the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 
specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the 

20 art. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard, Anal. Biochemu 107:220 (1980). Preferably, 
antibodies having a high degree of specificity and a high binding affinity for the target antigen 
are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by limiting 
25 dilution procedures and grown by standard methods. Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from the 
culture medium or ascites fluid by conventional immunoglobulin purification procedures such 
30 as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 
those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the 
invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
35 oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
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light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or 
myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of 
5 monoclonal antibodies in the recombinant host cells. The DNA also can be modified, for 

example, by substituting the coding sequence for human heavy and light chain constant domains 
in place of the homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 368, 
812-1 3 (1 994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
10 polypeptide can be substituted for the constant domains of an antibody of the invention, or can 
be substituted for the variable domains of one antigen-combining site of an antibody of the 
invention to create a chimeric bivalent antibody. 
3.103 HUMANIZED ANTIBODIES 

The antibodies directed against the protein antigens of the invention can further comprise 

1 5 humanized antibodies or human antibodies. These antibodies are suitable for administration to 
humans without engendering an immune response by the human against the administered 
immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen- 
binding subsequences of antibodies) that are principally comprised of the sequence of a human 

20 immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
Humanization can be performed following the method of Winter and co-workers (Jones et al., 
Nature, 321 :522-525 (1986); Riechmann et al., Nature. 332:323-327 (1988); Verhoeyen et al., 
Science. 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some 

25 instances, Fv fraipework residues of the human immunoglobulin are replaced by corresponding 
non-human residues. Humanized antibodies can also comprise residues which are found neither 
in the recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two, variable 
domains, in which all or substantially all of the CDR regions correspond to those of a non-human 

30 immunoglobulin and all or substantially all of the framework regions are those of a human 

immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 
least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. BioL 
2:593-596(1992)). 

35 
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3.10.4 HUMAN ANTIBODIES 

Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 
Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell 
hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: MONOCLONAL 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 
antibodies may be utilized in the practice of the present invention and may be produced by using 
human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by 
transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1985 In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol.. 227:381 (1991); 
Marks et al, J. Mol. Biol., 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in humans 
in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach 
is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 779-783 (1992)); Lonberg et al. 
(Nature 368 856-859 (1994)); Morrison ( Nature 368, 812-13 (1994)); Fishwild et al,( Nature 
Biotechnology 14, 845-51 (1996)); Neuberger (Nature Biotechnology 14, 826 (1996)); and 
Lonberg and Huszar (Intern. Rev. Immunol. 13 65-93 (1995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than the animal's endogenous 
antibodies in response to challenge by an antigen. (See PCT publication WO94/02602). The 
endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host 
have been incapacitated, and active loci encoding human heavy and light chain immunoglobulins 
are inserted into the host's genome. The human genes are incorporated, for example, using yeast 
artificial chromosomes containing the requisite human DNA segments. An animal which 
provides all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the modifications. The 
preferred embodiment of such a nonhuman animal is a mouse, and is termed the Xenomouse™ 
as disclosed in PCT publications WO 96/33735 and WO 96/34096. This animal produces B 
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cells which secrete fully human immunoglobulins. The antibodies can be obtained directly from 
the animal after immunization with an immunogen of interest, as, for example, a preparation of a 
polyclonal antibody, or alternatively from immortalized B cells derived from the animal, such as 
hybridomas producing monoclonal antibodies. Additionally, the genes encoding the 
5 immunoglobulins with human variable regions can be recovered and expressed to obtain the 
antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 
example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, lacking 
expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 

10 5,939,598. It can be obtained by a method including deleting the J segment genes from at least 
one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the 
locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy chain locus, 
the deletion being effected by a targeting vector containing a gene encoding a selectable marker; 
and producing from the embryonic stem cell a transgenic mouse whose somatic and germ cells 

1 5 contain the gene encoding the selectable marker. 

A method for producing an antibody of interest, such as a human antibody, is disclosed in 
U.S. Patent No. 5,916,771. It includes introducing an expression vector that contains a 
nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, introducing 
an expression vector containing a nucleotide sequence encoding a light chain into another 

20 mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell expresses an 
antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically relevant 
epitope on an immunogen, and a correlative method for selecting an antibody that binds 
immunospecifically to the relevant epitope with high affinity, are disclosed in PCT publication 

25 WO 99/53049. 

3.10.5 F fl b FRAGMENTS AND SINGLE CHAIN ANTIBODIES 

According to the invention, techniques can be adapted for the production of single-chain 
antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent No. 4,946,778). 

30 In addition, methods can be adapted for the construction of F a b expression libraries (see e.g., 
Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective identification of 
monoclonal F ab fragments with the desired specificity for a protein or derivatives, fragments, 
analogs or homologs thereof. Antibody fragments that contain the idiotypes to a protein antigen 
may be produced by techniques known in the art including, but not limited to: (i) an F^ 

35 fragment produced by pepsin digestion of an antibody molecule; (ii) an F a b fragment generated 
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by reducing the disulfide bridges of an ¥ W2 fragment; (iii) an F a b fragment generated by the 
treatment of the antibody molecule with papain and a reducing agent and (iv) F v fragments. 



3.10.6 BISPECIFIC ANTIBODIES 

5 Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 

have binding specificities for at least two different antigens. In the present case, one of the 
binding specificities is for an antigenic protein of the invention. The second binding target is any 
other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit. 
Methods for making bispecific antibodies are known in the art. Traditionally, the 

10 recombinant production of bispecific antibodies is based on the co-expression of two 

immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature. 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a 
potential mixture of ten different antibody molecules, of which only one has the correct 

1 5 bispecific structure. The purification of the correct molecule is usually accomplished by affinity 
chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 
1993, and mTrauneckere/tf/., 1991 EMBO J., 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 

20 preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of 
the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin 
light chain, are inserted into separate expression vectors, and*are co-transfected into a suitable 

25 host organism. For further details of generating bispecific antibodies see, for example, Suresh et 
aL, Methods in Enzvmologv. 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a pair 
of antibody molecules can be engineered to maximize the percentage of heterodimers which are 
recovered from recombinant cell culture. The preferred interface comprises at least a part of the 

30 CH3 region of an antibody constant domain. In this method, one or more small amino acid side 
chains from the interface of the first antibody molecule are replaced with larger side chains (e.g. 
tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on the interface of the second antibody molecule by replacing large amino 
acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for 

35 increasing the yield of the heterodimer over other unwanted end-products such as homodimers. 
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Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. 
F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody 
fragments have been described in the literature. For example, bispecific antibodies can be 
prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a procedure 
wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. These 
fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to 
stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments 
generated are then converted to fhionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific 
antibody. The bispecific antibodies produced can be used as agents for the selective 
immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe 
the production of a fully humanized bispecific antibody F(ab')2 molecule. Each Fab' fragment 
was separately secreted from E. coli and subjected to directed chemical coupling in vitro to form 
the bispecific antibody. The bispecific antibody thus formed was able to bind to cells 
overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity 
of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fragments directly from 
recombinant cell culture have also been described. For example, bispecific antibodies have been 
produced using leucine zippers. Kostelny et al., J. Immunol. 148(5):1547-1553 (1992). The 
leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two 
different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region 
to form monomers and then re-oxidized to form the antibody heterodimers. This method can 
also be utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 
alternative mechanism for making bispecific antibody fragments. The fragments comprise a 
heavy-chain variable domain (V H ) connected to a light-chain variable domain (V L ) by a linker 
which is too short to allow pairing between the two domains on the same chain. Accordingly, 
the V H and V L domains of one fragment are forced to pair with the complementary V L and V H 
domains of another fragment, thereby forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 
reported. See, Gruber et al., J. Immunol. 1 52:5368 (1994). 
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Antibodies with more than two valencies are contemplated. For example, trispecific 
antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1991). 
Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
originates in the protein antigen of the invention. Alternatively j an anti-antigenic arm of an 
5 immunoglobulin molecule can be combined with an arm which binds to a triggering molecule on 
a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc receptors for 
IgG(Fc R),suchasFc RI (CD64),Fc RII (CD32) and Fc RIII (CD16) so as to focus 
cellular defense mechanisms to the cell expressing the particular antigen. Bispecific antibodies 
can also be used to direct cytotoxic agents to cells which express a particular antigen. These 
1 0 antibodies possess an antigen-binding arm and an arm which binds a cytotoxic agent or a 

radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody 
of interest binds the protein antigen described herein and further binds tissue factor (TF). 

3.10.7 HETEROCONJUGATE ANTIBODIES 

1 5 Heteroconjugate antibodies are also within the scope of the present invention. 

Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies 
have, for example, been proposed to target immune system cells to unwanted cells (U.S. Patent 
No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 03089). 
It is contemplated that the antibodies can be prepared in vitro using known methods in synthetic 

20 protein chemistry, including those involving crosslinking agents. For example, immunotoxins 
can be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980. 

25 3.10.8 EFFECTOR FUNCTION ENGINEERING 

It can be desirable to modify the antibody of the invention with respect to effector 
function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing interchain 
disulfide bond formation in this region. The homodimeric antibody thus generated can have 
30 improved internalization capability and/or increased complement-mediated cell killing and 
antibody-dependent cellular cytotoxicity (ADCC). See Caron et al., J. Exp Med., 176: 1 191- 
1 195 (1992) and Shopes, J. Immunol., 148: 291 8-2922 (1992). Homodimeric antibodies with 
enhanced anti-tumor activity can also be prepared using heterobifunctional cross-linkers as 
described in Wolff et al. Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody can 
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be engineered that has dual Fc regions and can thereby have enhanced complement lysis and 
ADCC capabilities. See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 



3.10.9 IMMUNOCONJUGATES 

5 The invention also pertains to immunoconjugates comprising an antibody conjugated to a 

cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of 
bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a 
radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been 

10 described above. Enzymatically active toxins and fragments thereof that can be used include 
diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 
PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, 

15 mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 

radionuclides are available for the production of radioconjugated antibodies. Examples include 
2,2 Bi, ,3I I, m In, 90 Y, and I86 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional 
protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), 

20 iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), 
active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido 
compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as 
bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), 
and bis-active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a 

25 ricin immunotoxin can be prepared as described in Vitetta et al., Science, 238: 1098 (1987). 
Carbon- 14-labeled I-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX- 
DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
30 streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent. 
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3.11 COMPUTER READABLE SEQUENCES 

In one application of this embodiment, a nucleotide sequence of the present invention can 
be recorded on computer readable media. As used herein, "computer readable media" refers to 
any medium that can be read and accessed directly by a computer. Such media include, but are 
5 not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. A skilled 
artisan can readily appreciate how any of the presently known computer readable mediums can 
be used to create a manufacture comprising computer readable medium having recorded thereon 

10 a nucleotide sequence of the present invention. As used herein, "recorded" refers to a process for 
storing information on computer readable medium. A skilled artisan can readily adopt any of the 
presently known methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present invention. 
A variety of data storage structures are available to a skilled artisan for creating a 

1 5 computer readable medium having recorded thereon a nucleotide sequence of the present 

invention. The choice of the data storage structure will generally be based on the means chosen 
to access the stored information. In addition, a variety of data processor programs and formats 
can be used to store the nucleotide sequence information of the present invention on computer 
readable medium. The sequence information can be represented in a word processing text file, 

20 formatted in commercially-available software such as WordPerfect and Microsoft Word, or 

represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, 
Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring 
formats (e.g. text file or database) in order to obtain computer readable medium having recorded 
thereon the nucleotide sequence information of the present invention. 

25 By providing any of the nucleotide sequences SEQ ID NO: 1-84, or 168-251, or a 

representative fragment thereof; or a nucleotide sequence at least 95% identical to any of the 
nucleotide sequences of SEQ ID NO: 1-84, or 168-251 in computer readable form, a skilled 
artisan can routinely access the sequence information for a variety of purposes. Computer 
software is publicly available which allows a skilled artisan to access sequence information 

30 provided in a computer readable medium. The examples which follow demonstrate how 

software which implements the BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and 
BLAZE (Brutlag et al., Comp. Chem. 17:203-207 (1993)) search algorithms on a Sybase system 
is used to identify open reading frames (ORFs) within a nucleic acid sequence. Such ORFs may 
be protein-encoding fragments and may be useful in producing commercially important proteins 
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such as enzymes used in fermentation reactions and in the production of commercially useful 
metabolites. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
5 present invention. The minimum hardware means of the computer-based systems of the present 
invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of the currently available 
computer-based systems are suitable for use in the present invention. As stated above, the 
computer-based systems of the present invention comprise a data storage means having stored 

10 therein a nucleotide sequence of the present invention and the necessary hardware means and 
software means for supporting and implementing a search means. As used herein, "data storage 
means" refers to memory which can store nucleotide sequence information of the present 
invention, or a memory access means which can access manufactures having recorded thereon 
the nucleotide sequence information of the present invention. 

1 5 As used herein, "search means" refers to one or more programs that are implemented on 

the computer-based system to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to identify 
fragments or regions of a known sequence that match a particular target sequence or target motif. 
A variety of known algorithms are disclosed publicly and a variety of commercially available 

20 software for conducting search means are and can be used in the computer-based systems of the 
present invention. Examples of such software include, but are not limited to, Smith- Waterman, 
MacPattem (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA). A skilled artisan can 
readily recognize that any one of the available algorithms or implementing software packages for 
conducting homology searches can be adapted for use in the present computer-based systems. 

25 As used herein, a "target sequence" can be any nucleic acid or amino acid sequence of six or 
more nucleotides or two or more amino acids. A skilled artisan can readily recognize that the 
longer a target sequence is, the less likely a target sequence will be present as a random 
occurrence in the database. The most preferred sequence length of a target sequence is from 
about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide residues. 

30 However, it is well recognized that searches for commercially important fragments, such as 
sequence fragments involved in gene expression and protein processing, may be of shorter 
length. 

As used herein, "a target structural motif," or "target motif," refers to any rationally 
selected sequence or combination of sequences in which the sequence(s) are chosen based on a 
35 three-dimensional configuration that is formed upon the folding of the target motif. There are a 
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variety of target motifs known in the art. Protein target motifs include, but are not limited to, 

enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited 

to, promoter sequences, hairpin structures and inducible expression elements (protein binding 

sequences). 

5 

3.12 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be used to 
control gene expression through triple helix formation or antisense DNA or RNA, both of which 
methods are based on the binding of a polynucleotide sequence to DNA or RNA. 

10 Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and are 
designed to be complementary to a region of the gene involved in transcription (triple helix-see 
Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 15241 :456 (1988); and Dervan 
et al., Science 251 :1360 (1991)) or to the mRNA itself (antisense-Olmno, J. Neurochem. 56:560 
(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 

1 5 Raton, FL (1 988)). Triple helix-formation optimally results in a shut-off of RNA transcription 
from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
polypeptide. Both techniques have been demonstrated to be effective in model systems. 
Information contained in the sequences of the present invention is necessary for the design of an 
antisense or triple helix oligonucleotide. 

20 

3.13 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression of 
one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic 
acid probe or antibodies of the present invention, optionally conjugated or otherwise associated 
25 with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the polynucleotide 
for a period sufficient to form the complex, and detecting the complex, so that if a complex is 
detected, a polynucleotide of the invention is detected in the sample. Such methods can also 
30 comprise contacting a sample under stringent hybridization conditions with nucleic acid primers 
that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed 
polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is 
detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise contacting 
35 a sample with a compound that binds to and forms a complex with the polypeptide for a period 
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sufficient to form the complex, and detecting the complex, so that if a complex is detected, a 

polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 

antibodies or one or more of the nucleic acid probes of the present invention and assaying for 

5 binding of the nucleic acid probes or antibodies to components within the test sample. 

Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 

Incubation conditions depend on the format employed in the assay, the detection methods 

employed, and the type and nature of the nucleic acid probe or antibody used in the assay. One 

skilled in the art will recognize that any one of the commonly available hybridization, 

10 amplification or immunological assay formats can readily be adapted to employ the nucleic acid 
probes or antibodies of the present invention. Examples of such assays can be found in Chard, 
T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 
Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in Immunocytochemistry, 
Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice 

1 5 and Theory of immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Publishers, Amsterdam, The Netherlands (1985). The test samples of the 
present invention include cells, protein or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sample used in the above-described method 
will vary based on the assay format, nature of the detection method and the tissues, cells or 

20 extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane 
extracts of cells are well known in the art and can be readily be adapted in order to obtain a 
sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 
necessary reagents to carry out the assays of the present invention. Specifically, the invention 

25 provides a compartment kit to receive, in close confinement, one or more containers which 
comprises: (a) a first container comprising one of the probes or antibodies of the present 
invention; and (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting presence of a bound probe or antibody. 

In detail, a compartment kit includes any kit in which reagents are contained in separate 

30 containers. Such containers include small glass containers, plastic containers or strips of plastic 
or paper. Such containers allows one to efficiently transfer reagents from one compartment to 
another compartment such that the samples and reagents are not cross-contaminated, and the 
agents or solutions of each container can be added in a quantitative fashion from one 
compartment to another. Such containers will include a container, which will accept the test 

35 sample, a container, which contains the antibodies used in the assay, containers, which contain 
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wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and containers, which 
contain the reagents used to detect the bound antibody or probe. Types of detection reagents 
include labeled nucleic acid probes, labeled secondary antibodies, or in the alternative, if the 
primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of 
5 reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed 
probes and antibodies of the present invention can be readily incorporated into one of the 
established kit formats that are well known in the art. 

3.14 MEDICAL IMAGING 

10 The novel polypeptides and binding partners of the invention are useful in medical 

imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or infection). 
See, e.g., Kunkel et al., U.S. Pat. No. 5,413,778. Such methods involve chemical attachment of a 
labeling or imaging agent, administration of the labeled polypeptide to a subject in a 

15 pharmaceutical^ acceptable carrier, and imaging the labeled polypeptide in vivo at the target 
site. 

3.15 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present invention 
20 further provides methods of obtaining and identifying agents which bind to a polypeptide set 
forth in SEQ ID NO: 85-167, or 252-335 encoded by an ORF corresponding to any of the 
nucleotide sequences set forth in SEQ ID NO: 1-84, or 168-251, or which binds to a specific 
domain of the polypeptide encoded by the nucleic acid. In detail, said method comprises the 
steps of: 

25 (a) contacting an agent with an isolated protein encoded by an ORF of the present 

invention, or nucleic acid of the invention; and 

(b) determining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 
polynucleotide of the invention can comprise contacting a compound with a polynucleotide of 
30 the invention for a time sufficient to form a polynucleotide/compound complex, and detecting 
the complex, so that if a polynucleotide/compound complex is detected, a compound that binds 
to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to a 
polypeptide of the invention can comprise contacting a compound with a polypeptide of the 
35 invention for a time sufficient to form a polypeptide/compound complex, and detecting the 
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complex, so that if a polypeptide/compound complex is detected, a compound that binds to a 
polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polypeptide of the invention can also 
comprise contacting a compound with a polypeptide of the invention in a cell for a time 
5 sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a 
receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 
sequence expression, so that if a polypeptide/compound complex is detected, a compound that 
binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds that modulate the 

10 activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 

activity observed in the absence of the compound). Alternatively, compounds identified via such 
methods can include compounds that modulate the expression of a polynucleotide of the 
invention (that is, increase or decrease expression relative to expression levels observed in the 
absence of the compound). Compounds, such as compounds identified via the methods of the 

1 5 invention, can be tested using standard assays well known to those of skill in the art for their 
ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected 
and screened at random or rationally selected or designed using protein modeling techniques. 

20 For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and 

the like are selected at random and are assayed for their ability to bind to the protein encoded by 
the ORF of the present invention. Alternatively, agents may be rationally selected or designed. 
As used herein, an agent is said to be "rationally selected or designed" when the agent is chosen 
based on the configuration of the particular protein. For example, one skilled in the art can 

25 readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 
like, capable of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et aL, Application of Synthetic Peptides: Antisense 
Peptides," In Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et al., Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 

30 In addition to the foregoing, one class of agents of the present invention, as broadly 

described, can be used to control gene expression through binding to one of the ORFs or EMFs 
of the present invention. As described above, such agents can be randomly screened or 
rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 
sequence specific or element specific agents, modulating the expression of either a single ORF or 

35 multiple ORFs that rely on the same EMF for expression control. One class of DNA binding 
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agents are agents which contain base residues which hybridize or form a triple helix formation 
by binding to DNA or RNA. Such agents can be based on the classic phosphodiester, 
ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric derivatives that have 
base attachment capacity. 
5 Agents suitable for use in these methods preferably contain 20 to 40 bases and are 

designed to be complementary to a region of the gene involved in transcription (triple helix-see 
Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et 
al., Science 251:1360 (1991)) or to the mRNA itself (antisense-Okano, J. Neurochem. 56:560 
(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 

10 Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription 
from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
polypeptide. Both techniques have been demonstrated to be effective in model systems. 
Information contained in the sequences of the present invention is necessary for the design of an 
antisense or triple helix oligonucleotide and other DNA binding agents. 

1 5 Agents that bind to a protein encoded by one of the ORFs of the present invention can be 

used as a diagnostic agent. Agents that bind to a protein encoded by one of the ORFs of the 
present invention can be formulated using known techniques to generate a pharmaceutical 
composition. 

20 3.16 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid 
hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The 
hybridization probes of the subject invention may be derived from any of the nucleotide 
sequences SEQ ID NO: 1-84, or 168-251. Because the corresponding gene is only expressed in 

25 a limited number of tissues, a hybridization probe derived from any of the nucleotide sequences 
SEQ ID NO: 1-84, or 168-251can be used as an indicator of the presence of RNA of cell type of 
such a tissue in a sample. Preferably a hybridization probe from any of nucleotide sequences 
SEQ ID NO: 1-84, or 168-251 can be used as an indicator of bone marrow tissue. 

Any suitable hybridization technique can be employed, such as, for example, in situ 

30 hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 provides 

additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used in 
PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. The 
probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a 
degenerate pool of possible sequences for identification of closely related genomic sequences. 
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Other means for producing specific hybridization probes for nucleic acids include the 
cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such vectors 
are known in the art and are commercially available and may be used to synthesize RNA probes 
in vitro by means of the addition of the appropriate RNA polymerase as 17 or SP6 RNA 
5 polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may 
be used to construct hybridization probes for mapping their respective genomic sequences. The 
nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a 
chromosome using well-known genetic and/or chromosomal mapping techniques. These 
techniques include in situ hybridization, linkage analysis against known chromosomal markers, 

1 0 hybridization screening with libraries or flow-sorted chromosomal preparations specific to 
known chromosomes, and the like. The technique of fluorescent in situ hybridization of 
chromosome spreads has been described, among other places, in Verma et al (1988) Human 
Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 

15 chromosome mapping techniques may be correlated with additional genetic map data. Examples 
of genetic map data can be found in the 1994 Genome Issue of Science (265: 198 If). Correlation 
between the location of a nucleic acid on a physical chromosomal map and a specific disease (or 
predisposition to a specific disease) may help delimit the region of DNA associated with that 
genetic disease. The nucleotide sequences of the subject invention may be used to detect 

20 differences in gene sequences between normal, carrier or affected individuals. 

3.17 PREPARATION OF SEQUENCING CHIPS AND ARRAYS 

A basic example is using 6-mers attached to 50 micron surfaces to give a chip with 
dimensions of 3 x 3 mm which can be combined to give an array of 20 x 20 cm. Another 
25 example is using 9-mer oligonucleotides attached to 10 x 10 microns surface to create a 9-mer 
chip, with dimensions of 5 x 5 mm. 4000 units of such chips may be used to create a 30 x 30 
array. In an array in which 4,000 to 16,000 oligochips are arranged into a square array. A plate, 
or collection of tubes, as also depicted, may be packaged with the array as part of the sequencing 
kit. 

30 The arrays may be separated physically from each other or by hydrophobic surfaces. One 

possible way to utilize the hydrophobic strip separation is to use technology such as the Iso-Grid 
Microbiology System produced by QA Laboratories, Toronto, Canada. 

Hydrophobic grid membrane filters (HGMF) have been in use in analytical food 
microbiology for about a decade where they exhibit unique attractions of extended numerical 

35 range and automated counting of colonies. One commercially available grid is ISO-GRID™ 
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from QA Laboratories Ltd. (Toronto, Canada) which consists of a square (60 x 60 cm) of 
polysulfone polymer (Gelman Tuffryn HT-450, .45 urn pore size) on which is printed a black 
hydrophobic ink grid consisting of 1600 (40 x 40) square cells. HGMF have previously been 
inoculated with bacterial suspensions by vacuum filtration and incubated on the differential or 
5 selective media of choice. 

Because the microbial growth is confined to grid cells of known position and size on the 
membrane, the HGMF functions more like an MPN apparatus than a conventional plate or 
membrane filter. Peterkin et al. (1987) reported that these HGMFs can be used to propagate and 
store genomic libraries when used with a HGMF replicator. One such instrument replicates 

10 growth from each of the 1600 cells of the ISO-GRID and enables many copies of the master 
HGMF to be made (Peterkin et al., 1 987). 

Sharpe et al. (1989) also used ISO-GRID HGMF form QA Laboratories and an 
automated HGMF counter (MI-100 Interpreter) and RP-100 Replicator. They reported a 
technique for maintaining and screening many microbial cultures. 

15 Peterkin and colleagues later described a method for screening DNA probes using the 

hydrophobic grid-membrane filter (Peterkin et al., 1989). These authors reported methods for 
effective colony hybridization directly on HGMFs. Previously, poor results had been obtained 
due to the low DNA binding capacity of the epoxysulfone polymer on which the HGMFs are 
printed. However, Peterkin et al. (1 989) reported that the binding of the DNA to the surface of 

20 the membrane was improved by treating the replicated and incubated HGMF with 

polyethyleneimine, a polycation, prior to contact with DNA. Although this early work uses 
cellular DNA attachment, and has a different objective to the present invention, the methodology 
described may be readily adapted for Format 3 SBH. 

In order to identify useful sequences rapidly, Peterkin et al. (1989) used radiolabeled 

25 plasmid DNA from various clones and tested its specificity against the DNA on the prepared 
HGMFs. In this way, DNA from recombinant plasmids was rapidly screened by colony 
hybridization against 100 organisms on HGMF replicates that can be easily and reproducibly 
prepared. 

Manipulation with small (2-3 mm) chips, and parallel execution of thousands of the 
30 reactions. The solution of the invention is to keep the chips and the probes in the corresponding 
arrays. In one example, chips containing 250,000 9-mers are synthesized on a silicon wafer in 
the form of 8 x 8 mM plates (15 uM/oligonucleotide, Pease et al., 1994) arrayed in 8 x 12 format 
(96 chips) with a 1 mM groove in between. Probes are added either by multichannel pipette or 
pin array, one probe on one chip. To score all 4000 6-mers, 42 chip arrays have to be used, 
35 either using different ones, or by reusing one set of chip arrays several times. 
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In the above case, using the earlier nomenclature of the application, F=9; P=6; and 
F+P=l 5. Chips may have probes of formula BxNn, where x is a number of specified bases B; 
and n is a number of non-specified bases, so that x= 4 to 10 and n= 1 to 4. To achieve more 
efficient hybridization, and to avoid potential influence of any support oligonucleotides, the 
5 specified bases can be surrounded by unspecified bases, thus represented by a formula such as 
(N)nBx(N)m. 

3.18 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared by, for 
example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced 

1 0 using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be prepared by any of the methods known to those of 
skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is to 
precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can be 
achieved using passive adsorption (Inouye & Hondo, (1990) J. Clin. Microbiol. 28(6) 1469-72); 

1 5 using UV light (Nagata et al, 1 985; Dahlen et aL, 1 987; Moirissey & Collins, (1 989) MoL Cell 
Probes 3(2) 189-207) or by covalent binding of base modified DNA (Keller et al, 1 988; 1 989); all 
references being specifically incorporated herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 
interaction as a linker. For example, Broude et al (1 994) Proc. Natl. Acad. Sci. USA 91 (8) 3072-6, 

20 describe the use of biotinylated probes, although these are duplex probes that are immobilized on 
streptavidin-coated magnetic beads. Streptavidin-coated beads may be purchased from Dynal, 
Oslo. Of course, this same linking chemistry is applicable to coating any surface with streptavidin. 
Biotinylated probes may be purchased from various sources, such as, e.g., Operon Technologies 
(Alameda, CA). 

25 Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. Nunc 

Laboratories have developed a method by which DNA can be covalently bound to the microwell 
surface termed Co valink NH. CovaLink NH is a polystyrene surface grafted with secondary amino 
groups (>NH) that serve as bridge-heads for further covalent coupling. CovaLink Modules may be 
purchased from Nunc Laboratories. DNA molecules may be bound to CovaLink exclusively at the 

30 5-end by a phosphoramidate bond, allowing immobilization of more than 1 pmol of DNA 
(Rasmussener al y (1991) Anal. Biochem. 198(1) 138-42). 

The use of CovaLink NH strips for covalent binding of DNA molecules at the 5-end has 
been described (Rasmussen et al., (1 991). In this technology, a phosphoramidate bond is employed 
(Chu et al, (1 983) Nucleic Acids Res. 1 1 (8) 65 1 3-29). This is beneficial as immobilization using 
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only a single covalent bond is preferred. The phosphoramidate bond joins the DNA to the 
CovaLinkNH secondary amino groups that are positioned at the end of spacer arms covalently 
grafted onto the polystyrene surface through a 2 nm long spacer arm. To link an oligonucleotide to 
CovaLinkNH via a phosphoramidate bond, the oligonucleotide terminus must have a 5-end 
5 phosphate group. It is, perhaps, even possible for biotin to be covalently bound to CovaLink and 
then streptavidin used to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and 
denaturing for 1 0 min. at 95°C and cooling on ice for 1 0 min. Ice-cold 0. 1 M 1 -methylimidazole, 
pH 7.0 (1-Melm 7 ), is then added to a final concentration of 10 mM 1-Melm7. A ss DNA solution is 
1 0 then dispensed into CovaLink NH strips (75 ul/well) standing on ice. 

Carbodiimide0.2 M l-ethyl-3-(3-dimethylaminopropyl)-carbodiimide(EDC), dissolved in 
1 0 mM 1 -Melm 7 , is made fresh and 25 ul added per well. The strips are incubated for 5 hours at 
50°C. After incubation the strips are washed using, e.g., Nunc-Immuno Wash; first the wells are 
washed 3 times, then they are soaked with washing solution for 5 min., and finally they are washed 
1 5 3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is that 
described in PCT Patent Application WO 90/03382 (Southern& Maskos), incorporated herein by 
reference. This method of preparing an oligonucleotide bound to a support involves attaching a 
nucleoside 3-reagent through the phosphate group by a covalent phosphodiester link to aliphatic 
20 hydroxyl groups carried by the support. The oligonucleotide is then synthesized on the supported 
nucleoside and protecting groups removed from the synthetic oligonucleotide chain under standard 
conditions that do not cleave the oligonucleotide from the support. Suitable reagents include 
nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 
25 arrays may be employed. For example, addressable laser-activated photodeprotectionmay be 

employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by 
Fodorer al (1991) Science 25 1(4995) 767-73, incorporated herein by reference. Probes may also 
be immobilized on nylon supports as described by Van Ness et al (1 991 ) Nucleic Acids Res. 
19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1 988) Anal. Biochem. 
30 1 69( 1 ) 1 04-8; all references being specifically incorporated herein. 

To link an oligonucleotide to a nylon support, as described by Van Ness et al (1991), 
requires activation of the nylon surface via alkylation and selective activation of the 5-amine of 
oligonucleotides with cyanuric chloride. 

One particular way to prepare support bound oligonucleotides is to utilize the 
35 light-generated synthesis described by Pease et al, (1 994) Proc. Natl. Acad. Sci. USA 91(11) 5022- 
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6, incorporated herein by reference). These authors used current photolithographic techniques to 
generate arrays of immobilized oligonucleotide probes (DNA chips). These methods, in which light 
is used to direct the synthesis of oligonucleotide probes in high-density, miniaturized arrays, utilize 
photolabile 5-protectediV-acyl-deoxynucleosidephosphoramidites,siirface linker chemistry and 
5 versatile combinatorial synthesis strategies. A matrix of 256 spatially defined oligonucleotide 
probes may be generated in this manner. 

3.19 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic 
DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC inserts, and RNA, 
1 0 including mRNA without any amplification steps. For example, Sambrook et al ( 1 989) describes 
three protocols for the isolation of high molecular weight DNA from mammalian cells (p. 
9.14-9.23). 

DNA fragments may be prepared as clones in Ml 3, plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PCR or other amplification methods. Samples 

1 5 may be prepared or dispensed in multiwell plates. About 1 00-1 000 ng of DNA samples may be 
prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of skill 
in the art including, for example, using restriction enzymes as described at 9.24-9.28 of Sambrook et 
al (1989), shearing by ultrasound and NaOH treatment. 

20 Low pressure shearing is also appropriate, as described by Schriefer et al (1 990) Nucleic 

Acids Res. 1 8(24) 7455-6, incorporated herein by reference). In this method, DNA samples are 
passed through a small French pressure cell at a variety of low to intermediate pressures. A lever 
device allows controlled application of low to intermediate pressures to the cell. The results of 
these studies indicate that low-pressure shearing is a useful alternative to sonic and enzymatic DNA 

25 fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the two 
base recognition endonuclease, Cv/JI, described by Fitzgerald et al (1 992) Nucleic Acids Res. 
20(1 4) 3753-62. These authors described an approach for the rapid fragmentation and fractionation 
of DNA into particular sizes that they contemplated to be suitable for shotgun cloning and 

30 sequencing. 

The restriction endonuclease Cv/JI normally cleaves the recognition sequence PuGCPy 
between the G and C to leave blunt ends. Atypical reaction conditions, which alter the specificity of 
this enzyme (Cv/JI**), yield a quasi-random distribution of DNA fragments form the small 
molecule pUC 1 9 (2688 base pairs). Fitzgerald et al (1 992) quantitatively evaluated the 
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randomness of this fragmentation strategy, using a CvzJI** digest of pUC 19 that was size 
fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z minus 
Ml 3 cloning vector. Sequence analysis of 76 clones showed that CvzJI** restricts pyGCPy and 
PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate 
consistent with random fragmentation. 

As reported in the literature, advantages of this approach compared to sonication and 
agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 
ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel 
electrophoresis and elution are needed 

Irrespective of the manner in which the nucleic acid fragments are obtained or prepared, it is 
important to denature the DNA to give single stranded pieces available for hybridization. This is 
achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is then cooled 
quickly to 2°C to prevent renaturation of the DNA fragments before they are contacted with the 
chip. Phosphate groups must also be removed from genomic DNA by methods known in the art. 

3.20 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. 
Spotting may be performed by using arrays of metal pins (the positions of which correspond to an 
array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a DNA solution to a 
nylon membrane. By offset printing, a density of dots higher than the density of the wells is 
achieved. One to 25 dots may be accommodated in 1 mm 2 , depending on the type of label used. By 
avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) 
may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same 
gene) from different individuals, or may be different, overlapped genomic clones. Each of the 
subarrays may represent replica spotting of the same samples. In one example, a selected gene 
segment may be amplified from 64 patients. For each patient, the amplified gene segment may be 
in one 96-well plate (all 96 wells containing the same sample). A plate for each of the 64 patients is 
prepared. By using a 96-pin device, all samples may be spotted on one 8 x 12 cm membrane. 
Subarrays may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the 
dot span may be 1 mm 2 and there may be a 1 mm space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, Illinois) 
which may be partitioned by physical spacers e.g. a plastic grid molded over the membrane, the grid 
being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic 
strips. A fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage 
screens or x-ray films. 
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The present invention is illustrated in the following examples. Upon consideration of the 

present disclosure, one of skill in the art will appreciate that many other embodiments and variations 

may be made in the scope of the present invention. Accordingly, it is intended that the broader 

aspects of the present invention not be limited to the disclosure of the following examples. The 

5 present invention is not to be limited in scope by the exemplified embodiments that are intended as 

illustrations of single aspects of the invention, and compositions and methods that are functionally 

equivalent are within the scope of the invention. Indeed, numerous modifications and variations in 

the practice of the invention are expected to occur to those skilled in the art upon consideration of 

the present preferred embodiments. Consequently, the only limitations that should be placed upon 

10 the scope of the invention are those which appear in the appended claims. 

All references cited within the body of the instant specification are hereby incorporated by 

reference in their entirety. 

4.0 EXAMPLES 

4.1 EXAMPLE 1 

15 Novel Nucleic Acid Sequences Obtained From Various Libraries 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared from various 
human tissues and in some cases isolated from a genomic library derived from human chromosomes 
using standard PCR, SBH sequence signature analysis and Sanger sequencing techniques. The 
inserts of the library were amplified with PCR using primers specific for the vector sequences that 

20 flank the inserts. Clones from cDNA libraries were spotted on nylon membrane filters and screened 
with oligonucleotide probes (e.g., 7-mers) to obtain signature sequences. The clones were clustered 
into groups of similar or identical sequences. Representative clones from each cluster were selected 
for sequencing. 

The sequence of the amplified inserts, in some cases, was then deduced using a typical 
25 Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye 

terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied Biosystems 
(ABI) sequencer to obtain the novel nucleic acid sequences. 

4.2 EXAMPLE 2 
Novel Nucleic Acids 

30 The novel nucleic acids of the present invention were assembled from sequences that were 

obtained from a cDNA library by methods described in Example 1 above, and in some cases 
sequences obtained from one or more public databases. The nucleic acids of SEQ ID NO: 1 -84, 
inclusive, were assembled using an EST sequence as a seed. Then a recursive algorithm was used 

100 



WO 01/74836 PCT/US01/10472 
to extend some of the seed ESTs into an extended assemblage, by pulling additional sequences from 
different databases (i.e., Hyseq's database containing EST sequences, dbEST version 1 1 9, gb pri 
1 1 9, and UniGene version 1 1 9, Geneseq October version, and Genscan, Genemark and Hyseq gene 
predictions on human genomic sequence from the human genome project updated October 2000) 
5 that belong to this assemblage. The algorithm terminated when there was no additional sequences 
from the above databases that would extend the assemblage. Inclusion of component sequences 
into the assemblage was based on a BLASTN hit to the extending assemblage with BLAST score 
greater than 300 and percent identity greater than 95%. 

10 43 EXAMPLE 3 

Further Characterization 

Clusters from Example 1 were identified which were expressed in bone marrow tissue 
cDNA libraries, but not in other tissues. Novel nucleic acids were assembled by the method of 
Example 2. A subset of the assembled nucleic acids comprising sequences from the identified 
1 5 clusters was selected. This subset includes SEQ ID NO: 1 -84. The tissue sources in which SEQ ID 
NO: 1-84 were exclusively expressed were found to be in BMD001 and BMD002 bone marrow 
libraries (Clontech). 

The homologies for SEQ ID NO: 1-84, and the corresponding peptide sequences, SEQ ID 
NO: 85-167, were obtained by performing various searches as shown in Tables 1 A to ID and as 
20 discussed herein. 

The homologous sequences to the amino acid sequences corresponding to SEQ ID NO: 
1-84 were obtained by a BLASTP version 2.0al 19MP-WashU search against the Geneseq 
database updated November 9, 2000, update 23 for year 2000 (Derwent), using the BLAST 
algorithm. The homologues for the amino acid sequences corresponding to SEQ ID NO: 1-84 
25 from Geneseq are shown in Table 1 A below. 

The homologous sequences to the amino acid sequences corresponding to SEQ ID NO: 
1-84 were also obtained by a BLASTP version 2.0al 19MP-WashU search against the NCBI 
Genbank nr database updated November 10, 2000, using the BLAST algorithm. The 
homologues for the amino acid sequences corresponding to SEQ ID NO: 1 -84 from Genbank are 
30 shown in Table IB below. 

The homologous sequences to SEQ ED NO: 1-84 were also obtained by a BLASTN 
version 2.0al 19MP-WashU search against the Geneseq database updated November 9, 2000, 
update 23 for year 2000 (Derwent), using the BLAST algorithm. The homologues for SEQ ID 
NO: 1-84 from Geneseq are shown in Table 1C below. 
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The homologous sequences to SEQ ID NO: 1-84 were also obtained by a BLASTN 
version 2.0al 19MP-WashU search against the NCBI Genbank nt database updated November 
10, 2000, using the BLAST algorithm. The homologues for SEQ ID NO: 1-84 from Genbank 
are shown in Table ID below. 

Using eMatrix software package (Stanford University, Stanford, CA) (Wu et al., J. 
Comp. Biol., Vol. 6 pp. 219-235 (1999) herein incorporated by reference), the polypeptide 
sequences corresponding to SEQ ID NO: 1-84 were examined to determine whether they had 
identifiable signature regions. Table 2 shows the signature region found in the indicated 
polypeptide sequences, the description of the signature, the eMatrix p-value(s) and the 
position(s) of the signature within the polypeptide sequence. 

Using the PFam software program (Sonnhammer et al., Nucleic Acids Res., Vol. 26(1) 
pp. 320-322 (1998) herein incorporated by reference) all the polypeptide sequences 
corresponding to SEQ ID NO: 1-84 were examined for domains with homology to certain 
peptide domains. Table 3 shows the name of the domain found, the description, the e-value and 
the PFam score for the identified domain within the sequence. 

The polypeptide sequence within each of the polypeptides corresponding to SEQ ID NO: 
1-84 that is the predicted signal peptide sequence and its cleavage site can be determined using 
Neural Network SignalP VI. 1 program (from Center for Biological Sequence Analysis, The 
Technical University of Denmark). The process for identifying prokaryotic and eukaryotic 
signal peptides and their cleavage sites are also disclosed by Henrik Nielson, Jacob Engelbrecht, 
Soren Brunak, and Gunnar von Heijne in the publication " Identification of prokaryotic and 
eukaryotic signal peptides and prediction of their cleavage sites" Protein Engineering, Vol. 10, 
no. 1, pp. 1-6 (1997), incorporated herein by reference. A mean S score, as described in the 
Nielson et. al. was obtained for the polypeptide sequences. Table 4 shows the position of the 
predicted signal peptide in each of the polypeptides corresponding to SEQ ID NO: 1-84 and the 
mean score associated with that signal peptide. 

4.4 EXAMPLE 4 

Assemblage of Novel Nucleic Acids 

The contigs or nucleic acids of the present invention, designated as SEQ ID NO: 1 68-251 
were assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend 
the seed EST into an extended assemblage, by pulling additional sequences from different databases 
(i.e., Hyseq's database containing EST sequences, dbEST version 1 1 5, gb pri 1 1 5, and UniGene 
version 1 03 and exons from public domain genomic sequences predicted by Genscan) that belong to 
this assemblage. The algorithm terminated when there was no additional sequences from the above 
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databases that would extend the assemblage. Inclusion of component sequences into the 
assemblage was based on a BLASTN hit to the extending assemblage with BLAST score greater 
than 300 and percent identity greater than 95%. 

Table 6 sets forth the novel predicted polypeptides (including proteins) encoded by the 
novel polynucleotides (SEQ ID NO: 1 68-25 1 ) of the present invention, and their corresponding 
translation start and stop nucleotide locations to each of SEQ ID NO: 1 68-25 1 . Table 6 also 
indicates the method by which the polypeptide was predicted. Method A refers to a polypeptide 
obtained by using a software program called FASTY (available from http://fasta.bioch.virginia.edu"> 
which selects a polypeptide based on a comparison of the translated novel polynucleotide to known 
polynucleotides (W.R. Pearson, Methods in Enzymology, 183:63-98 (1990), herein incorporated by 
reference). Method B refers to a polypeptide obtained by using a software program called GenScan 
for human/vertebrate sequences (available from Stanford University, Office of Technology 
Licensing) that predicts the polypeptide based on a probabilistic model of gene 
structure/compositionalproperties(C. Burge and S. Karlin, J. Mol. Biol., 268:78-94 (1997), 
incorporated herein by reference). Method C refers to a polypeptide obtained by using a Hyseq 
proprietary software program that translates the novel polynucleotide and its complementary strand 
into six possible amino acid sequences (forward and reverse frames) and chooses the polypeptide 
with the longest open reading frame. These polynucleotides and polypeptides have homology to the 
sequences selected in Example 3. 
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Table 1A 
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ID 

NO: 


Accession 
No 


Jtsiast ocore 


p-value 


% 
Identity 


Description 


1 
1 


Taenia 


1 £A1 

loyl 
(259.8bits) 


1 A« H(\ 


AO 

93 


TAOAO O TT. 1 * 

T93038 Human monoclonal 
antibody light chain GM4- 
IgG4.1ambda encoding DNA. 


a 

J 




ZOOO 

(406.1bits) 


A A^. 1 *7A 

y.Ue-170 


AO 

98 


CI 0865 Human secreted protein 
5' EST, SEQ ID NO: 14940. 
Length = 896 


A 

4 


C\&\ 1 7 A 


illy 

(173.9bits) 


o./e-4D 


97 


Q61 170 Human brain Expressed 
Sequence Tag EST01 7 15. 
Length = 305 


c 
J 


CUo j 1 4 


/CO A 

(100.6bits) 


6.3e-45 


1 A A 

100 


C06514 Human secreted protein 
5' EST, SEQ ID NO: 10589. 
Length = 741 


n 
I 




A 1 '"7^7 

2177 
(332.7bits) 


1 1 _ AA 

l.le-90 


99 


C27831 Human secreted protem 
5' EST, SEQ ID NO: 31906. 
Length = 442 


o 
O 


71 OO/CC 

Zliioj 


806 
(127.0bits) 


1.2e-30 


99 


Z13365 Human gene expression 
product cDNA sequence SEQ ID 
NO:834. Length = 300 


10 


T18679 


588 
(94.3bits) 


2.1e-20 


77 


T18679 Human lastin cDNA 
(partial sequence). Length = 
2223 


11 


Z65341 


3392 
(5 15. Obits) 


1.3e-255 


97 


Z65341 Human secreted protein 
gene 92. Length =1416 


12 


Q78896 


2269 
(346.5bits) 


1.7e-97 


99 


Q78896 VHL disease gene g7. 
Length =1816 


1 J 


zv /Uzo 


2999 
(456.0bits) 


O 1 ->A 

2.5e-130 | 


97 


Z97028 Human secreted protein 
gene 10 cDNA clone 
HDPWU34, SEQ ID NO:20. 


1 A 

14 


A4oJol 


1 a >t o 

1943 
(297.6bits) 


l.0e-82 


90 


A46361 Nucleotide sequence of 
the gene insert of CINN 1 . 
Length = 1549 


ID 


AloozJ 


(533.0bits) 


1.5e-236 


73 


A 16623 Human secreted protem 
clone pt332_l nucleotide 
sequence SEQIDNO:ll. 


1 A 
ID 


71 771 a 
Zl / / 1U 


(560.0bits) 


2.1e-161 


AO 

98 


r-T i «ni ATT- • 

Z17710 Human gene expression 
product cDNA sequence SEQ ID 
NO:5l83. Length = 758 


17 


V 04405 


^Q*27 

(806.1bits) 


A A 


AC 


V84468 Human secreted protein 
gene 58 clone HE9HU17. 
Length = 2483 


1 ft 


r*i7/i^A 

Ksl /4j0 


J4!> 
C57 Rbits^ 


1 Co> A A 


85 


CI 7456 Human secreted protein 

J i-»o 1 , OS2,\£ XU 1NU. jL 1 3 j I * 

Length = 157 


20 


V57903 


2178 
(332.8bits) 


1.0e-110 


89 


V57903 Hereditary 
haemochromatosis subregion 
from an HH affected individual. 


21 


X87150 


5147 


LOe-227 


98 


X87150 Human protease 



104 



WO 01/74836 



PCT/US01/10472 



SEQ 

ID 

NO: 


Accession 
No 


Blast Score 


p-value 


% 
Identity 


Description 






(778.3bits) 






HUPM-2 cDNA. Length = 3043 


22 


A45360 


1189 
ri84 4bitsk 


4.9e-48 


88 


A45360 Mouse secreted 
expressed sequence lag oiiv^ 
NO:1935. Length = 374 


23 


C26801 


1830 
(280 6bitsi 


5.2e-77 


96 


C26801 Human secreted protein 

J Ho 1 , OxjV^ llv IN \J. jyJo /D. 

Length = 393 


24 


Z16343 


1930 


7.9e-80 


97 


Z 16343 Human gene expression 
prouuci cltn/\ sequence o.c\£ i\j 
NO:3813. Length = 465 


25 


X29140 


6333 
(956.3bits) 


2.0e-281 


72 


X29140 Hypoxia-regulated gene 
sequence RTP220. Length = 
4121 






(925.9bits) 




00 
yy 


Z,yuo3 1 Human adipose tissue 
protein #1 encoding DNA. 
Length = 3211 


28 


Z77502 


1581 


6.7e-66 


84 


Z77502 Human ovarian tumor 
cjjin A library denved bo 1 
fragment 53. Length = 540 


30 


A58471 


377 

^OZ.ODllS^ 


2.5e-07 


55 


A58471 Nucleotide sequence of 
the bleomycin (BLM) gene 
cluster ORFs 8-30. 


32 


X98701 


3063 


5.2e-132 


95 


X98701 Human validated cancer 
ceil denved cDJNA 7? 15. Length 
= 750 


33 


V34159 


1317 


3.5e-53 


89 


V34159 Human secreted protein 
gene o clone JnLtJ ML Y y 1 . Length 
= 425 


34 


X34656 


1275 


l.le-52 


66 


X34656 Human ZIP-kinase 
(serine/threonine kinase) 
encoding DNA. Length = 2132 


36 


C08395 


580 
(93 lbits'* 


1.4e-20 


79 


C08395 Human secreted protein 

D Ho 1 , Oliv INLr. 1ZH-/U. 

Length = 406 


40 


X37471 


400 
(66. lbits) 


7 4^-10 


ion 


s±j 1 riuman secreiea protein 
cDNA fragment containing gene 
21. Length = 990 


41 


T22028 


487 
(79.1bits) 


7 3e-16 


OS 


l zzuzo Jri urn an gene signaiure 
HUMGS03571. Length = 127 


42 


Z94751 


447 
(73 lbit<T> 


2.1e-12 


57 


Z94751 Human ATP binding 
cassene /\x>v^/\o ^/\r>v^-newj 
cDNA. Length = 2911 


43 


C31590 


1353 
(209. lbits) 


1.7e-55 


98 


C3 1 590 Human secreted nrotein 
5' EST, SEQ ID NO: 35665. 
Length = 417 


44 


C02717 


1313 
(203. lbits) 


1.7e-53 


98 


C02717 Human secreted protein 
5' EST, SEQ ID NO: 2715. 
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SEQ 

ID 

NO: 


Accession 
No 


Blast Score 


p-value 


% 
Identity 


Description 












Length = 268 


45 


Z87318 


385 
(63.8bits) 


9.9e-08 


57 


Z87318 S. venezuelae pik 
(macrolide biosynthesis) gene 
cluster. Length = 36,778 


46 


V12391 


383 
(63.5bits) 


5.9e-09 


64 


V12391 Mouse osteoclast 
transporter protein encoding 
cDNA. Length = 2102 


47 


Z16086 


318 
(53.8bits) 


3.2e-06 


• 59 


Z 16086 Human gene expression 
product cDNA sequence SEQ ID 
NO:3556. Length = 754 


48 


V32401 


1467 
(226.2bits) 


1.7e-60 


99 


V32401 Homo sapiens spry3 
gene. Length = 300 


50 


i C26006 


996 
(155.5bits) 


4.6e-39 


98 


C26006 Human secreted protein 
5' EST, SEQ ID NO: 30081 . 
Length = 208 


60 


C09439 


913 
(143.0bits) 


2.4e-35 


94 


C09439 Human secreted protein 
5' EST, SEQ ID NO: 13514. 
Length = 222 


71 


C27703 


640 
(102.1 bits) 


2.7e-23 


73 


C27703 Human secreted protein 
5' EST, SEQ ID NO: 31778. 
Length = 431 


82 


C32463 


177 
(32.6bits) 


4.1e-07 


94 


C32463 Human secreted protein 
5' EST, SEQ ID NO: 36538. 
Length = 100 
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Table IB 



SEQ 
ID 
NO: 


Accession 
No 


Blast Score 


n-vnliip 

U VAIUV 


% 
Identity 


l/catr I jj iiu JJ 


1 


L29164 1 


2253 
(344.1bits) 


3 5E-95 


96 


HTTMTGT 7F Human 

immunoglobulin light chain 
variable region (lambda-IIIb 

^iiTicrrfmrA frrvm ToA/f 

ollU^iUUJpy JUL Will IgiVJ. 

rheumatoid factor 


2 


AF305057.1 


2236 
(341.5bits) 


1.4E-93 


98 


"AF305057 Homo sapiens 
RTS (RTS) gene, complete 

vUjj cUltlllaLl vciy opiiccti 


3 


X52851.1 


1092 
(169.9bits) 


6.9E-143 


82 


HSCPH70 Human cyclophilin 
gene for cyclophilin (EC 
5.2. 1.8) Length -6711 


4 


L21 936.1 


2302 


4.2E-102 


96 


"HUMSPHX Human 
succinate dehydrogenase 
flavoprotein subunit (SDH) 
mRNA, complete cds" 


5 


AL033529.2 
5 


681 
(108.2bits) 


1.6E-46 


98 


"HS2705 Human DNA 
sequence from clone RP1- 
z/KJj on cnromosome lpJ4.l- 
35.3, complete sequence 
[Homo sapiens]" 




AT 191601 1 

3 


i vy / 
(170.6bits) 




Oft 
70 


rioUJ-M jVji rluman JL/1N A 
sequence from clone RP1- 
315G1 on chromosome Xq24- 
25. Contains a PDZ (DHR, 
vjjuvjp ^ uunidiii protein 


7 


AL162331.1 


33983 
(5104.9bits) 


0.0 


98 


HS118D241 Novel human 
gene mapping to chomosome 1 
jociigui — oy+i 


8 


AK023176. 
1 


5874 
(887.4bits) 


1.8E-259 


99 


"AK023176 Homo sapiens 
cDNA FLJ1 3 1 14 fis, clone 
NT2RP3002603 T enoth = 
2730" 


9 


AB037855. 
1 


5253 
f 794 Obits) 


0.0 


93 


"AB037855 Homo sapiens 

mRNA fnrKTAA14^4 nmtf»in 

partial cds Length = 5443" 


10 


AC004890. 
2 


617 
(98.6bits) 


2E-20 


94 


"AC004890 Homo sapiens 
PAC clone RP4-800G7 from 
7q35-q36, complete sequence" 


11 


AB020653. 
1 


5341 
(807.4bits) 


0.0 


97 


"AB020653 Homo sapiens 
mRNA for KIAA0846 protein, 
complete cds Length = 4204" 


12 


AC007999. 
11 


1914 
(293.2bits) 


9.1E-144 


96 


AC007999 Homo sapiens 
3q25-26 BAC CTB-177N7 
(California Institute of 
Technology BAC Library) 
complete sequence 
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SEQ 
ID 
NO: 


Accession 
No 


Blast Score 


p-value 


% 
Identity 


Description 


13 


AB020598. 
1 


6253 
(944.3bits) 


1.7E-276 


98 


"AB020598 Homo sapiens 
mRNA for peptide transporter 
3, complete cds" 


14 


AF153607.1 


1995 
(305.4bits) 


8.2E-84 


91 


"AF1 53607 Homo sapiens 
pxl9 mRNA, complete cds 
Length = 943" 


15 


AB032996. 
1 


12529 
(1885.9bits) 


0.0 


99 


"AB032996 Homo sapiens 
mRNA for KIAA1 170 protein, 
partial cds Length = 5073" 


16 


AB033007. 
1 


12151 
(1829.2bits) 


0.0 


99 


"AB033007 Homo sapiens 
mRNA for KIAA1181 protein, 
partial cds Length = 2432" 


17 


ALII 7430.1 


5341 
(807.4bits) 


7.7E-293 


95 


HSM800939 Homo sapiens 
mRNA; cDNA 
DKFZp434D156 (from clone 
DKFZp434D156); partial cds 


18 


AL357654.9 


685 
(108.8bits) 


1.6E-23 


100 


"AL357654 Human DNA 
sequence from clone RP5- 
1025P18 on chromosome 20 
Contains ESTs, STSs and 
GSSs, complete sequence 
[Homo" 


19 


AB020677. 
2 


15274 
(2297.8bits) 


0.0 


99 


"AB020677 Homo sapiens 
mRNA for KIAA0870 protein, 
partial cds Length = 4628" 


20 


AC009505. 
3 


2855 
(434.4bits) 


8.5E-201 


89 


"AC009505 Homo sapiens 
BAC clone RP1 1.-526D2 from 
2, complete sequence" 


21 


AF206019.1 


15735 
(2366.9bits) 


0.0 


99 


"AF206019 Homo sapiens 
REV1 protein (REV1) mRNA, 
complete cds Length = 4276" 


22 


8923709 ref 


1547 
(238.2bits) 


1.7E-63 


93 


NM_017548.1 


23 


Z83840.7 


4939 
(747.1bits) 


1.2E-215 


85 


HS216E10 Human DNA 
sequence from clone CTA- 
2 1 6E1 0 on chromosome 22 
Contains the NHP2L1 gene for 
non-histone chromosome 
protein 2 


24 


AB028958. 
1 


15904 
(2392.3bits) 


0.0 


98 


"AB028958 Homo sapiens 
mRNA for KIAA1035 protein, 
partial cds Length = 5124" 


25 


AF273437.1 


16890 
(2540.2bits) 


0.0 


100 


"AF273437 Homo sapiens 
actin binding protein anillin 
mRNA, complete cds" 


26 


AB011792. 
1 


6121 
(924.4bits) 


0.0 


99 


"AB01 1792 Homo sapiens 
mRNA for extracellular matrix 
protein, complete cds" 


27 


AF1 30358.2 


2233 


1.1E-105 


97 


"AF130358 Homo sapiens 
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ID 

NO: 


No 




p-vaiue 


©/ 
/o 

Identity 


Description 






^jH 1 . 1 DllS J 






cnromosome z 1 q n .z rAL 
90B5, complete sequence" 


28 


AC009289. 
o 


2003 

^JUO.ODllSJ 


6E-108 


96 


"AC009289 Homo sapiens, 
clone KJr i i-^Kfjj, complete 
sequence Length = 146,010" 


29 


AC007386. 
3 


1768 
(271.3bits) 


1.2E-116 


94 


"AC007386 Homo sapiens 
BAC clone RP11-359K10 
irom z, compiexe sequence 


30 


AK024129. 
1 


795 
(125.3bits) 


1.5E-25 


97 


"AK024129 Homo sapiens 
cDNA FLJ14067 fis, clone 
HEMBB1001315 Length - 
4153" 




Ar , nn70QQ 
/\L/UU fyyy. 

11 


(125.3bits) 


4.yjb-l lo 


TO 

/y 


Acuu/yyy Homo sapiens 
3q25-26 BAC CTB-177N7 
(California Institute of 
Technology BAC Library) 
complete sequence 


32 


AB037825. 

1 
1 


28786 

{hoZj.1 DIXS) 


0.0 


98 


"AB037825 Homo sapiens 
mKJN A tor 1SJLAA14U4 protein, 
partial cds Length = 7204" 


33 


AB023218. 

1 
1 


12029 
tioiu.yDitsj 


0.0 


98 


"AB023218 Homo sapiens 

W T)\T A f«w VT A A 1AA1 ^4-^1 

rnKiNA tor JsJLAAluUl protem, 
complete cds Length = 4304" 


34 


AB007144. 
1 


1275 
(197.4bits) 


1.2E-51 


66 


" AB007 1 44 Homo sapiens 
mRNA for ZIP-kinase, 
complete cds Length — zl05 


35 


AC006241. 
1 


1031 
(160.7bits) 


6.9E-48 


99 


"AC006241 Homo sapiens 
chromosome 9, clone 
nKriv.zUZ_JHi_i, complete 
sequence" 


36 


AC008733. 
7 


759 
(119.9bits) 


7.7E-27 


96 


"AC008733 Homo sapiens 
chromosome 19 clone CTD- 
2525 Jl 5, complete sequence" 


37 


AF026813.1 


1627 
(250.2bits) 


2.4E-67 


92 


AF026813 Homo sapiens 
topoisomerase III gene 
promoter region Length = 


38 


AK022932. 
1 


3459 
(525.0bits) 


2.3E-191 


99 


"AK022932 Homo sapiens 
CDNAFLJ12870 fis, clone 
IN I ZivrZulf J /z / j^engui — 
2566" 


40 


AF161365.1 


2726 
(415-lbits) 


1.4E-116 


99 


"AF161365 Homo sapiens 
HSPC102 mRNA, partial cds 
Length = 547" 


41 


AL121934 1 
7 


687 
(109.1bits) 


5 5E-41 




HRRA209A? Human TVNFA 
sequence from clone RP1 1- 
209A2 on chromosome 6. 
Contains an RPL10 (60S 
ribosomal protein LI 0) 
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SEQ 
ID 
NO: 


Accession 
No 


Blast Score 


p-value 


% 
Identity 


Description 


42 


AF250238.1 


447 
(73.1bits) 


2.6E-11 


57 


"AF250238 Homo sapiens 
macrophage ABC transporter 
CABCA7)mRNA comnlete 
cds" 


43 


AB046641. 
1 


2389 
(364.5bits) 


1E-181 


97 


"AB046641 Macaca 
fascicularis brain cDNA, clone 
QccE-16161 Length = 2901" 


44 


AL3 65224 8 


2296 
(350.5bits) 


9E-107 


99 


"AL365224 Human DNA 
sequence from clone RP1 1 - 
96B21 on chromosome 6, 
complete sequence [Homo 
sapiens]" 


45 


AF1 68787 1 


12094 
(1820.6bits) 


0 0 


99 


.TiX 1 V/O / O / X7.UII1U oClUiCiJo 

vanilloid receptor gene, partial 
sequence; CARKL and CTNS 
genes, complete cds; TIP1 

crpnp "nartial Pfl*:* P^^SIV 


46 


AC000353. 
27 


821 
(129.2bits) 


1.2E-29 


77 


"AC000353 Homo sapiens 
Chromosome 1 lql3 BAC 

Plnnp 1 8T1 ^ pnmiVlptp 

V^H-JIIC lOil-7, CUlligJlClC 

sequence" 


47 


U37263.1 


522 
(84.4bits) 


2E-16 


64 


"HSU37263 Human KRAB 
zinc finger protein (ZNF177) 


48 


AJ271735.1 


7131 
(1076.0bits) 


0.0 


99 


"HSA271735 Homo sapiens 
Xq pseudoautosomal region; 

^fompnt 1 /9 T pno+h = 

240,000" 


50 


L22009.1 


576 
(92 5bits"> 


1.9E-16 


79 


"HUM49KDA Human hnRNP 

T-T TnT^"MA mmnlptp pHq 

XX llliVi'lilj WlXJLJJlvLt^ tuo 

Length = 2201" 


51 


AF249738.1 


1537 
(236.7bits) 


1.8E-63 


73 


AF249738 Mus museums 
Pb99 gene sequence Length = 
2127 


55 


AL1 38994.3 


366 
f61 0bits"> 


0.0000024 


57 


CNS01DWZ Human 

rhrnmnmmp 14 TYMA 

sequence *** IN PROGRESS 
*** BAC C-2046P20 of 
Hbrarv CalTech-D from 
chromosome 14 of Homo 


59 


AC010137. 
3 


1325 
(204.9bits) 


2.7E-112 


100 


"AC010137 Homo sapiens 
BAC clone RP1 1-169D1 from 
Y, complete sequence" 


60 


AK024343. 
1 


2098 
(320.8bits) 


7.5E-89 


90 


"AK024343 Homo <?anipn<: 
cDNA FLJ14281 fis, clone 
PLACE1 005611, weakly 
similar to Mus musculus 
mRNAformDjlO" 


62 


AL034375.2 


1140 


4E-65 


97 


HS523G1 Human DNA 
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SEQ 
ID 
NO: 


Accession 
No 


Blast Score 


p-value 


% 
Identity 


Description 




3 


(177.1bits) 






sequence from clone 523 Gl on 
chromosome 6p22.3-24.1 
Contains part of the mRNA for 
SOAl f soinoeerebellar 


65 


AC005740. 
1 


3048 
(463.4bits) 


3.1E-130 


99 


"AC005740 Homo sapiens 
chromosome 5p, BAC clone 
50g21 (LBNL HI 54), 
complete sequence" 


69 


AL135783 6 


1648 
(253.3bits) 


5 4E-67 


99 


"AT 1 ^S7itt Human DNA 
sequence from clone RP3- 
527F8 on chromosome Xq25- 
27.1, complete sequence 
[Homo sapiens]" 


70 


AL049715 2 
5 


1824 
(279.7bits) 


9 7P-R7 




rioJOHOx i x nuiiiaii jum.n/\ 
sequence from clone RP4- 
646P1 1 on chromosome 1, 

rrimnl^t^ cpnnpnpp IT-Frvmr* 
V/UHipiCLC bcqUCUCC [X1.UIJ1U 

sapiens]" 


71 


AC006157. 

2 


1801 
(276 3hitsk 


6.6E-74 


99 


"AC006157 Homo sapiens 

VI AC rlonp PP1 1 -41409^ 

from Y, complete sequence" 


75 


AC016622. 
5 


688 
H09 3bit<f» 


1.2E-23 


90 


"AC016622 Homo sapiens 

UU UIJlUoUJllC J dUUC 1 XJ 

2343F18, complete sequence" 


78 


AC010627. 
5 


3031 
(460.8bits) 


1.8E-129 


98 


"AC010627 Homo sapiens 
chromosome 5 clone CTD- 
2165H16, complete sequence" 


79 


AL 132822 1 
5 


1257 
(194.6bits) 






noj i ui / r o nuiiiaii 
sequence from clone RP5- 
101 7F8 on chromosome 20 

Contains 9TS<: fi^Ss AnH » 

CpG Island, complete" 


80 


AC012315. 
5 


1150 
(178 6bits"i 


1.7E-44 


100 


"AC012315 Homo sapiens 

rfirnmnQAmp ^ plnnp C 111— 

2122YJ, complete sequence" 


82 


AC026425. 
3 


1196 
(185.5bits) 


1.4E-46 


99 


"AC026425 Homo sapiens 
chromosome 5 clone CTD- 
2183D23, complete sequence" 


83 


AP001331.1 


2020 
(309.1bits) 


8.5E-84 


100 


"AP0Q1331 Homo sapiens 
genomic DNA, chromosome 
8q23 5 clone:KB1153C10" 
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Table 1C 



SEQ 
ID 
NO: 


Accession 
No 


Blast Score 


p-value 


% 

Tfipntifv 


Description 


1 


W34081 


374 
(136.7bits) 


1.6e-34 


85 


W34081 Human monoclonal 
antibodv 1it>ht chain fiM4- 
IgG4.1ambda. Length = 129 


3 


G03831 


104 
(41.7bits) 


6 6e-06 


39 


OO^R^I l-finnan ^PPrptpH nrntpin 

VJ \J -J \J -J I X X LlXxlCXXX ot>LltlCU UlULCUlj 

SEQ ID NO: 7912. Length = 165 


9 


G46687 


196 
(74 1 bits') 


3.8e-15 


29 


G46687 Arabidopsis thaliana 

Tvrntpin fraornpnt ^FD TD XTO* 

58762. Length = 374 


10 

X V 


R94903 


136 
(52.9bits) 


3 9e-08 


52 


PQ4QfH Human ln<2tin T pnoth = 
J\7^7vJ xxlxxllclxx labUil. -UCIigUl — 

675 


11 


Y70963 


687 
(246 9bitsl 


7.2e-70 


51 


Y70963 Human Ras signalling 

nfiTn Ax/a 1/ dccnfMatpfi nrf\fpin 
pauiway aooUl/ldlcu piULClll 

CalDAG-GEFII. Length = 797 


12 


R66286 


304 
n 12 1 hit<;1 


4.2e-27 


100 


R66286 VHL disease gene g7 

TiTTvrlnr'T T ^TirrrVi = 08/1 


13 


G16993 


169 
(64.5bits) 


2.1e-10 


31 


G16993 Arabidopsis thaliana 
protein fragment SEQ ID NO: 
i / oho. i^engui — ^to^f 


15 


Y94903 


1182 
(421.1bits) 


3.8e-120 


59 


Y94903 Human secreted protein 
clone pt332_l protein sequence 

O-CV^ 1L/ IN KJ. 1Z. 


16 


G33308 


291 


2.3e-39 


35 


G33308 Zea mays protein 
Length = 350 


17 


W88812 


143 
(55.4bits) 


3 7e-09 




wooou jruiypepuue lragineni 
encoded by gene 58. Length = 
452 


19 


R90766 


178 

I/O 

(67.7bits) 


2 3e-09 


24 


\) Q\)fr\r\ Tl l m r\~\ l T" oilnnrpoonr 

/ uu i luiiuLU oupprcbbur 
protein HTS-1. Length = 1 137 


20 


R76595 


118 

(46 6bits") 


2.2e-07 


27 


R76595 MoMLV mutated gag 

malriv rvrntpin T pn oiV\ ~ 


21 


G48221 


286 
(105.7bits) 


5.0e-38 


43 


G48221 Arabidopsis thaliana 
protein fragment SEQ ID NO: 
60872 Length =1114 

\J\J\J X-i^XllZjVXX I J 1 T 


24 


Y13055 


183 
(69 5bits") 


3.9e-14 


97 


Y13055 Human secreted protein 
encoded bv 5' EST SFO TD NO- 
69. Length -39 


25 


Y03636 


3020 
f 1068 2bits') 
6.5e-315 


6.5e-315 


71 


Y03636 Hypoxia-regulated gene 
RTP220 nrodnct T pnath = ^4 


26 


Y67598 


1051 
(375.0bits) 


2.9e-106 


99 


Y67598 Human adipose tissue 
protein #1 . Length = 699 


28 


Y76628 


143 
(55.4bits) 


4.8e-10 


56 


Y76628 Human ovarian tumor 
EST fragment encoded protein 
124. Length = 94 


33 


W75062 


359 
(131.4bits) 


6.2e-33 


93 


W75062 Human secreted protein 
encoded by gene 6 clone 
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SEQ 
ED 
NO: 


Accession 
No 


Blast Score 


p-value 


% 
Identity 


Description 












HBMCY91. Length =73 


34 


Y06921 


267 
(99.0bits) 


9.7e-23 


39 


Y06921 Human ZIP-kinase 
(serine/threonine kinase). Length 
= 454 


39 


W81727 


127 
(49.8bits) 


1.8e-07 


41 


W81727 M. tuberculosis 
immunogenic polypeptide TbH- 
30. Length =174 


43 


G02019 


128 
(50.1 bits) 


2.0e-07 


38 


G02019 Human secreted protein, 
SEQ ID NO: 6100. Length = 82 


44 


G02711 


411 

(149.7bits) 


1.9e-38 


97 


G0271 1 Human secreted protein, 
SEQ ID NO: 6792. Length = 81 


46 


W44195 


116 

(45.9bits) 


4.0e-06 


42 


W44195 Mouse osteoclast 
transporter protein. Length = 537 


48 


W48792 


643 
(231.4bits) 


5.0e-63 


46 


W48792 Homo sapiens sprouty 
2 protein. Length = 315 


49 


Y32167 


184 

(69.8bits) 


2.2e-14 


59 


Y32167 Soybean E2F protein 
fragment. Length = 80 


60 


Y91941 


145 
(56.1 bits) 


1.6e-09 


43 


Y91941 Human chaperone 
protein 2 (HCHP-2). Length = 
375 
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Table ID 



SEQ 
ID 
NO: 


Accession 
No 


Blast Score 


p-value 


% 
Identity 


Description 


1 


AAA66494. 
1 


413 
(150.4bits) 


2.0e-38 


89 


(L291 64) immunoglobulin 
light chain variable region 
[Homo sapiens] 


3 


CSRT31 


140 
(54.3bits) 


1.7e-09 


58 


CSRT31 protein P31 -rat>prf 


5 


AAF55906. 
1 


141 
(54.7bits) 


1.3e-09 


69 


(AE003735) CG6353 gene 
product [Drosophila 
melanogaster] Length =156 


7 


CAB82724. 
1 


5366 
(1894.0bits) 


0.0 


97 


(AL162331) hypothetical 
protein [Homo sapiens] Length 
= 2270 


8 


BAB14447.- 
1 


900 
(321.9bits) 


5.0e-90 


98 


(AK023 176) unnamed protein 
product [Homo sapiens] 
Length = 265 


9 


BAA92672. 
1 


1305 
(464.4bits) 


6.0e-133 


99 


(AB037855) KIAA1434 
protein [Homo sapiens] >emb 


10 


AAD45827. 
1 


132 
(51.5bits) 


1.2e-08 


80 


AC004890_4 (AC004890) 
similar to zinc finger oroteins* 
similar to BAA24380 [Homo 
sapiens] 


11 


NP 056191. 
1 


1292 
(459.9bits) 


1.4e-131 


91 


KIAA0846 orotein FHomo 

^— *• *■ -»"*- AVV* 1 Vp/ L/A V/ Will 1 XlvlJlv 

sapiens] >dbj|BAA74869. 1 1 
(AB020653) KIAA0846 
protein [Homo sapiens] 


12 


BAB14132. 
1 


759 
(272.2bits) 


4.3e-75 


97 


fAK.022613") unnamed orotein 
product [Homo sapiens] 
Length = 664 


13 


NP 057666. 
1 


957 
(341.9bits) 


4.5e-96 


100 


peptide transporter 3 [Homo 
sapiens] >dbj|BAA93432.1| 
(AB020598) peptide 
transporter 3 [Homo sapiens] 


15 


BAA86484. 
1 


2104 
(745.7bits) 


1.3e-217 


99 


(AB032996)KIAA1170 
protein [Homo sapiens] Length 
= 838 


16 


BAA86495. 
1 


1545 
(548.9bits) 


2.2e-158 


100 


(AB033007) KIAA1181 
protein [Homo sapiens] Length 
= 336 


17 


NP 056345. 
1 


143 
(55.4bits) 


4.7e-09 


39 


DKFZP4341 56 protein 


19 


BAA74893. 
2 


5332 
(1882.0bits) 


0.0 


99 


(AB020677) KIAA0870 
protein [Homo sapiens] Length 
= 1019 


20 


GAG AVIS 
N 


160 
(61.4bits) 


3.7e-ll 


42 


GAG AVISN GAG 
POLYPROTEIN 
[CONTAINS: CORE 
PROTEIN PI 5; INNER COAT 
PROTEIN PI 2; CORE SHELL 
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SEQ 
ID 
NO: 


Accession 
No 


Blast Score 


p-value 


% 
Identity 


Description 












PROTEIN P30] >pir 


21 


NP 057400. 
1 


2952 
(1044.2bits) 


1.8e-307 


100 


REV1 protein {Homo sapiens] 
>gb|AAF06731.1|AF151538 1 
(AF151538)deoxycytidyl 
transferase; 


22 


NP 060018. 
1 


385 
(140.6bits) 


1.9e-35 


100 


hvnothetical tyrotein [Homo 
sapiens] 

>gb|AAF02423.1|AF103803J 
(AF 103 803) unknown [Homo 
sapiens] 


23 


AAG00552. 
1 


164 
(62.8bits) 


4.5e-09 


25 


AF286473_1 (AF286473) 
retinitis pigmentosa GTPase 
regulator FMus musculusl 


24 


NP 056054. 
1 


1355 
(482.0bits) 


3.0e-138 


100 


KIAA1035 protein [Homo 
sapiens] >dbj|BAA91749.1| 
(AK001 544) unnamed protein 
oroduct fHomo satwensl 


25 


NP 061155. 
1 


5727 
(2021.1bits) 


0.0 


100 


anillin [Homo sapiens] 
>gb|AAF75796.1|AF273437 1 
(AF273437) actin binding 
protein anillin [Homo sapiens] 


26 


NP 001384 

1 


1051 
(375.0bits) 


4 9e-106 


99 


pxtrficpl hilar matriv "nrntpin 0 

[Homo sapiens] 
>dbj|BAA33958.I| 
CAB01 1792^ extracellular 
matrix protein [Homo 


30 


AAF48140. 

2 


168 
(64.2bits) 


4.5e-08 


26 


(AE003488) CG2779 gene 
Droduct FDrosoDhila 
melanogaster] Length =1612 


31 


B71413 


194 
(73 4bits) 


7.2e-15 


39 


B71413 hypothetical protein 
dl3525w - Arabidonsis thaliana 
>emb 


32 


BAA92102. 
1 


1661 
(589.8bits) 


l.le-170 


90 


(AK002139) unnamed protein 
product [Homo sapiens] 
Length = 893 


33 


NP 055775. 
1 


759 
(272.2bits) 


4.3e-75 


96 


KIAA1001 protein [Homo 
sapiens] >dbj|BAA76845.1| 
(AB023218) KIAA1001 
protein [Homo sapiens] 


34 


NP 001339. 
1 


267 
(99.0bits) 


1.6e-22 


39 


death-associated nrotein 
kinase 3 [Homo sapiens] 
>dbj|BAA24955.1| 
(AB007144) ZIP-kinase 
[Homo sapiens] 


38 


BAB14313. 
1 


571 
(206.1bits) 


1.6e-81 


94 


(AK022932) unnamed protein 
product [Homo sapiens] 
Length = 838 


39 


A56154 


149 

(57.5bits) 


5.9e-07 


28 


A56154 Abl substrate ena 
(enabled) - fruit fly 
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SEQ 
ID 
NO: 


Accession 
No 


Blast Score 


p-value 


% 
Identity 


Description 












(Drosophila melanogaster) >gb 


43 


BAB03558. 
1 


582 
(209.9bits) 


2 5e-56 


79 


(AB046640) hypothetical 
protein [Macaca fascicularis] 
Length = 740 


45 


BAB00640. 
1 


679 
(244.1bits) 


1.3e-66 


50 


(AB036930)hapsin [Mus 
musculus] Length = 754 


46 


CAB09724. 
1 


120 
(47.3bits) 


2.7e-06 


46 


(Z97028) renal organic anion 
transporter 
[Pseudopleuronectes 
americanus] 


47 


AAB09748 

1 


133 
(51.9bits) 


9.4e-09 


39 


fU3725n Descrintion' KRAB 
zinc finger protein; this is a 
splicing variant that contains a 
ston codon and frame shift 
between 


48 


CAB96768. 
1 


1586 
C563 4bits> 


1.0e-162 


100 


(AJ271735) sprouty 
rDro^onhila^ homoloi? 3 
[Homo sapiens] Length = 288 


49 


CAC01815. 
1 


199 
(75.1bits) 


2.5e-15 


42 


(AL391 146) E2F transcription 
factor-like protein 
[Arabidopsis thaliana] 


60 


NP 060096 
1 


145 
(56.1bits) 


2.7e-09 


43 


hvnothetical nrotein FLJ20027 

11 Y UVUlwllvUl Ulvlvlll X X/J mvU^ t 

[Homo sapiens] 
>dbj|BAA90896.1| 
(AK000034) unnamed protein 
product [Homo sapiens] 
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SEQ 

ID 

NO: 


Accession 
No: 


Description 


p-value 


Raw 
Score 


Residue 
Position 


9 


PD01922B 


PROTEIN PHOSPHODIEST 
ERASE HYDROL. 


8.714E-20 


21.83 


63-99 


13 


BL01022B 


PTR2 family proton/o 
ligopeptide symporters 
proteins 


6.016E-14 


22.19 


. 72-118 


13 


BL01022A 


PTR2 family proton/o 
ligopeptide symporters 
proteins 


9.135E-12 


11.58 


42-61 


13 


PR00490A 


SECRETIN RECEPTOR 
SIGNATURE 


6.889E-09 


4.58 


191-204 


20 


PF01140A 


Matrix protein (MA), PI 5 


2.274E-12 


11.51 


1-55 


26 


BL01208B 


VWFC domain proteins 


1.0E-13 


15.83 


.7-22 


26 


BL00422C 


Granins proteins. 


5.765E-09 


16.18 


132-160 


26 


BL00422C 


Granins proteins. 


7.706E-09 


16.18 


126-154 


28 


PR00234B 


HIV-1 MATRIX PROTEIN 
SIGNATURE 


7.25E-09 


17.94 


108-127 


30 


PD00787B 


SYNTHASE 
BIOSYNTHESIS 
SIGNATURE 


8.085E-09 


13.26 


792-806 


33 


BL00523A 


Sulfatases proteins. 


7.5E-17 


13.36 


36-53 


33 


BL00523C 


Sulfatases proteins. 


6.143E-14 


12.64 


129-140 


33 


BL00523B 


Sulfatases proteins. 


8.105E-14 


8.64 


84-96 


48 


PR00614A 


NI-FE HYDROGENASE 
SMALL SUBUNIT 
SIGNATURE 


8.373E-09 


13.66 


159-182 


49 


PD02910A 


TRANSCRIPTION 
PROTEIN FACTOR 
REGULATION A. 


7.0E-16 


15.43 


146-181 


52 


PR00209B 


ALPHA/BETA GLIADIN 
FAMILY SIGNATURE 


9.906E-09 


4.88 


124-143 


58 


BL01166F 


RNA polymerases beta chain 
protiens 


6.049E-09 


7.27 


175-186 


60 


PR00625B 


DNAJ PROTEIN FAMILY 
SIGNATURE 


1.321E-13 


13.48 


33-54 


60 


BL00636B 


Nt-dnaJ domain proteins 


8.333E-13 


15.11 


33-54 
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SEQ 


pFam model 


Accession 


Predicted 


Predicted 


pfam 


e-value 


ID 


name 


No: 


beginning 


end of 


Score 




NO: 






of 


domain 












domain 








1 


ig 


PF01812 


9 


65 


16 


0.0022 


3 


pro isomerase 


PF00160 


62 


78 


8.6 


0.22 


10 


KRAB 


PF01352 


4 


34 


-12.7 


0.29 


11 


RasGEF 


PF00617 


146 


308 


-13 


1.10E-05 


12 


VHL 


PF01847 


159 


217 


149.4 


6.50E-41 


13 


PTR2 


PF00854 


101 


185 


9.2 


0.06 


19 


DENN 


PF02141 


117 


223 


84.8 


1.60E-22 


19 


TPR 


PF01365 


489 


522 


10.5 


3.6 


19 


WD40 


PF00400 


867 


904 


12.3 


1.3 


19 


WD40 


PF00400 


908 


949 


14 


0.8 


19 


WD40 


PF00400 


1089 


1128 


6.1 


8.7 


20 


gag_MA 


PF01140 


2 


68 


42.2 


3.30E-11 


21 


BRCT 


PF00533 


44 


131 


57.3 


3.30E-13 


21 


ODC_AZ 


PF02100 


333 


355 


2.2 


8.2 


25 


j PH 


PF01636 


985 


1108 


60.2 


2.00E-15 


48 


metalthio 


PF00131 


121 


191 


-10.5 


6.9 


51 


hormone 


PF00103 


16 


25 


1.1 


4.7 


60 


DnaJ 


PF00226 


23 


56 


-0.8 


0.032 
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TABLE 4 



SEQID 
NO: 


Signal Denticles Dosition 


Mean Score 


Cutoff 


Conclusion 


1 


1-55 


0.108 


0.48 


NO 


2 


1-9 


0.116 


0.48 


NO 


3 


1-10 


0.166 


0.48 


NO 


4 


1-143 


0.16 


0.48 


NO 


5 


1-16 


0.135 


0.48 


NO 


6 


1-8 


0.104 


0.48 


NO 


7 


1-746 


0.077 


0.48 


NO 


8 


1-31 


0.63 


0.48 


YES 


9 


1-221 


0.062 


0.48 


NO 


10 


1-22 


0.075 


0.48 


NO 


11 


1-299 


0.117 


0.48 


NO 


12 


1-89 


0.252 


0-48 


NO 


13 


1-177 


0.462 


0.48 


NO 


14 


1-73 


0.458 


0.48 


NO 


15 


1-268 


0.186 


0.48 


NO 


16 


1-46 


0.447 


0.48 


NO 


17 


1-130 


0.119 


0.48 


NO 


18 


1-36 


0.278 


0.48 


NO 


19 


1-508 


0.157 


0.48 


NO 


20 


1-6 


0.293 


0.48 


NO 


21 


1-258 


0.052 


0 48 


NO 


22 


1-37 


0.095 


0.48 


NO 


23 


1-71 


0.036 


0.48 


NO 


24 


1-96 


0.115 


0.48 


NO 


25 


1-623 


0.051 


0.48 


NO 


26 


1-38 


0.125 


0.48 


NO 


27 


1-63 


0.068 


0.48 


NO 


28 


1-14 


0.221 


0.48 


NO 


29 


1-42 


0.402 


0.48 


NO 


30 


1-724 


0.096 


0.48 


NO 


• 31 


1-150 


0.083 


0.48 


NO 


32 


1-284 


0.103 


0.48 


NO 


33 


1-16 


0.895 


0.48 


YES 


34 


1-136 


0.062 


0.48 


NO 


35 


1-43 


0.233 


0.48 


NO 


36 


1-146 


0.073 


0.48 


NO 


37 


1-66 


0.072 


0.48 


NO 


38 


1-150 


0.253 


0.48 


NO 


39 


1-115 


0.077 


0.48 


NO 


40 


1-79 


0.137 


0.48 


NO 


41 


1-16 


0.914 


0.48 


YES 


42 


1-83 


0.211 


0.48 


NO 


43 


1-132 


0.198 


0.48 


NO 


44 


1-94 


0.059 


0.48 


NO 


45 


1-463 


0.093 


0.48 


NO 


46 


1-61 


0.168 


0.48 


NO 


47 


1-131 


0.094 


0.48 


NO 
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SEQID 
NO: 


Signal peptides position 


Mean Score 


Cutoff 


Conclusion 


48 


1-237 


0.156 


0.48 


NO 


49 


1-77 


0.053 


0.48 


NO 


50 


1-8 


0.274 


0.48 


NO 


51 


1-126 


0.085 


0.48 


NO 


52 


1-37 


0.61 


0.48 


YES 


53 


1-24 


0.551 


0.48 


YES 


54 


1-24 


0.548 


0.48 


YES 


55 


1-77 


0.182 


0.48 


NO 


56 


1-25 


0.55 


0.48 


YES 


57 


1-127 


0.086 


0.48 


NO 


58 


1-89 


0.152 


0.48 


NO 


59 


1-15 


0.702 


0.48 


YES 


60 


1-41 


0.045 


0.48 


NO 


61 


1-61 


0.071 


0.48 


NO 


62 


1-104 


0.163 


0.48 


NO 


63 


1-34 


0.622 


0.48 


YES 


64 


1-36 


0.685 


0.48 


YES 


65 


1-18 


0.226 


0.48 


NO 


66 


1-51 


0.453 


0.48 


NO 


67 


1-51 


0.067 


0.48 


NO 


68 


1-11 


0.064 


0.48 


NO 


69 


1-0 


0 


0.48 


NO 


70 


1-23 


0.347 


0.48 


NO 


71 


1-43 


0.233 


0.48 


NO 


72 


1-20 


0.864 


0.48 


YES 


73 


1-13 


0.428 


0.48 


NO 


74 


1-55 


0.346 


0.48 


NO 


75 


1-13 


0.108 


0.48 


NO 


76 


1-24 


0.679 


0.48 


YES 


77 


1-61 


0.107 


0.48 


NO 


78 


1-24 


0.375 


0.48 


NO 


79 


1-69 


0.293 


0.48 


NO 


80 


1-22 


0.218 


0.48 


NO 


81 


1-14 


0.088 


0.48 


NO 


82 


1-18 


0.192 


0.48 


NO 


83 


1-0 


0 


0.48 


NO 
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dJLy 1 If 

NO: of 


NO: of 


OjL JLLr 

No: of 


No: of 


OXjvJ 111 XNU. 

in USSN 

MAM \J 1 


of mir.lpntf rip. 


oJkVJ JUL* INU: 
ci\f npntirip in 


nucleo- 
tide 


peptide 

Mr Mr 

sequence 


contig 
nucleo- 


contig 
peptide 


09/540,217 


in USSN 
60/250, 583 


USSN 
60/250, 583 


sequence 




tide 
sequence 


sequence 








1 


85 


168 


252 


22974 


2565 


2 


2 


86 


169 


253 


20878 


2577 


14 


3 


87 


170 


254 


12791 


2580 


17 


4 


88 


171 


255 


6072 


2588 


25 


5 


89 


172 


256 


16749 


2611 


48 


6 


90 


173 


257 


27958 


2612 


49 


7 


91 


174 


258 


25431 


2613 


50 


8 


92 


175 


259 


53 


2623 


60 


9 


93 


176 


260 


14203 


2632 


69 


10 


94 


177 


261 


25455 


2655 


92 


11 


95 


178 


262 


20399 


2663 


100 


12 


96 


179 


263 


18639 


2675 


112 


13 


97 


180 


264 


30435 


2676 


113 


14 


98 


181 


265 


9819 


2682 


119 


15 


99 


182 


266 


23487 


2694 


131 


16 


100 


183 


267 


27666 


2701 


138 


17 


101 


184 


268 


21075 


2703 


140 


18 


102 


185 


269 


5372 


2709 


146 


19 


103 


186 


270 


26608 


2712 


149 


20 


104 


187 


271 


7050 


2714 


151 


21 


105 


188 


272 


10656 


2718 


155 


22 


106 


189 


273 


8306 


2724 


161 


23 


107 


190 


274 


26811 


2733 


170 


24 


108 


191 


275 


19370 


2748 


185 


25 


109 


192 


276 


8838 


2753 


190 


26 


110 


193 


277 


2975 


2776 


213 


27 


111 


194 


278 


28343 


2778 


215 


28 


112 


195 


279 


23107 


2780 


217 


29 


113 


196 


280 


947 


2786 


223 


30 


114 


197 


281 


25499 


2787 


224 


31 


115 


198 


282 


26874 


2789 


226 


32 


116 


199 


283 


7863 


2791 


228 


33 


117 


200 


284 


12385 


2808 


245 


34 


118 


201 


285 


9325 


2811 


248 


35 


119 


202 


286 


135 


2830 


267 


36 


120 


203 


287 


948 


2842 


278 


37 


121 


204 


288 


11131 


2844 


280 


38 


122 


205 


289 


26590 


2848 


284 


39 


123 


206 


290 


29769 


2852 


288 


40 


124 


207 


291 


12703 


2884 


320 


41 


125 


208 


292 


19931 


2888 


324 


42 


126 


209 


293 


12950 


2892 


328 


43 


127 


210 


294 


16635 


2893 


329 
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SEQ ED 
NO: of 


SEQBD 
NO: of 


SEQED 
No: of 


SEQED 
No: of 


SEQ ED NO: 
in USSN 


SEQ ID NO: 
of nucleotide 


SEQ ID NO: 
of peptide in 


* 

nucleo- 
title 


peptide 
sequence 


contig 
nucleo- 


contig 
pepijue 


09/540,217 


in USS1N 

<Cfi/icft esrt 
OU/XDO* Doj 


1TCDXT 

USSN 






tide 
sequence 










44 


128 


211 


295 


11259 


2894 


330 


45 


129 


212 


296 


17066 


2910 


346 


46 


130 


213 


297 


27046 


2917 


353 


47 


131 


214 


298 


28443 


2920 


356 


48 


132 


215 


299 


12951 


2934 


370 


49 


133 


216 


300 


16401 


2937 


373 


50 


134 


217 [ 


301 


2095 


2941 


377 


51 


135 


218 


302 


14896 


2958 


394 


52 


136 


219 


303 


3942 


2969 


405 


53 


137 


220 


304 


18627 


2977 


413 


54 


138 


221 


305 


1334 


2978 


414 


55 


139 


222 


306 


12612 


2986 


422 


56 


140 


223 


307 


26757 


2992 


428 


57 


141 


224 


308 


29643 


3010 


446 


58 


142 


225 


309 


17502 


3024 


460 


59 


143 


226 


310 


15745 


3033 


469 


60 


144 


227 


311 


16448 


3085 


521 


61 


145 


228 


312 


2700 


3097 


533 


62 


146 


229 


313 


3038 


3124 


560 


63 


147 


230 


314 


5507 


3146 


582 


64 


148 


231 


315 


21158 


3203 


639 


65 


149 


232 


316 


30322 


3267 


703 


66 


150 


233 


317 


29542 


3292 


728 


67 


151 


234 


318 


! 19566 


3358 


794 


68 


152 


235 


319 


180 


3399 


835 


69 


153 


236 


320 


2278 


3404 


840 


70 


154 


237 


321 


6132 


3542 


978 


71 


155 


| 238 


322 


138 


3645 


1081 


72 


156 


239 


323 


29552 


3652 


1088 


73 


157 


240 


324 


28639 


3658 


1094 


74 


158 


241 


325 


186 


3732 


1168 




159 


242 


326 


7065 


3765 


1201 


76 


160 


243 


327 


18073 


3901 


1337 


77 


161 


244 


328 


513 


3985 


1421 


78 


162 


245 


329 


6994 


4036 


1472 


79 


163 


246 


330 


189 


4080 


1515 


80 


164 


247 


331 


191 


4260 


1695 


81 


165 


248 


332 


8767 


4798 


2226 


82 


166 


249 


333 


23899 


4997 


2425 


83 


167 


250 


334 


533 


5057 


2485 


84 




251 


335 


6903 


2835 
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SEQ 


M 


Predicted 


Predicted 


Amino acid segment containing signal 


ID 


e 


beginning 


end 


peptide (A=Alanine 0=Cysteine, 


NO: 


t 


nucleotide 


nucleotide 


D=Aspartic Acid, E=Glutamic Acid, 




h 


location 


location 


F=Phenylalanine, G=Glycine, 




0 


corres- 


corres- 


H=Histidine, I=Isoleucine, K=Lysine, 




d 


ponding to 


ponding to 


L=Leucine, M=Methionine, 






first amino 


first amino 


N=Asparagine, P=Proline, Q=GIutamine, 






acid 


acid residue 


R=Arginine, S=Serine, T=Threonine, 






residue of 


of amino acid 


V=Valine, W=Tryptophan, Y=Tyrosine, 






amino acid 


sequence 


X=Unknown, *=Stop cod on, /=possib!e 






sequence 




nucleotide deletion, \=possib!e nucleotide 
insertion 


252 


A 


3 


744 


RQSSGNLTMAWTPLLLPLLTFCTVSEA 

SYELTQPPSVSVSPGQTATITCSGDALP 

KKVHPYWYQQKSGQAPVLVIYEDNKR 

PSGIP\ERFSASSSGTMATLTISGAQVED 

EADYYCYSTDSSGNHRGVFGGGTRLT 

VLSQPKAAPSVTLFPPSSEELQANKAT 

LVCLISDFYPGAVTVAWKADSSPVKA 

GVETTTPGKQSNNKYAASSYLSXLTPE 

QWKSHKSYSCQVTHEGXSTVEETGAPT 

EYLLRVY 


253 


B 


1 


1617 


MEKGSGFIKYSTYKQGTIRVAEEAETA 

HSSVLIGPEKGWHLATAAVLNAVWD 

LWAKQEGKVLAVGRELQEEEKEETG 

WRKAQAAVEGGVGTWWLTASIRAAN 

AFTVRKKWGLYTYVLQILSFLLQACLE 

VTCGHDLIMGCEKSKENKSPAIKYRPE 

NTPEPVSTSVSHYGAEPTTVSPCPSSSA 

KGTAVNFSSLSMTPFGGSSGVTPFGGA 

SSSFSWPSSYPAGLTGGVTIFVALYDY 

EARTTEDLSFKKGERFQIINNTPMVLN 

LGQNHPGDIWQYLETFLVVTVGVLPLS 

SSASTPVFDRVTNGVTPTIKDLTGCCV 

ENRLLTSNSSDFFTLINHSNSSKTPFQN 

TRLWSRGNSSEKQFAIRFQDGKTDHA 

IQLSSGKKTALGREALEHPESLDSRKV 

GQRSRWSSQAASPISGPIQAETALLCPG 

DQWTQEFHTSPLLTVPHLPDIYTLDCC 

RKDFSIYIHSFGDITQSYIFKYHLQIDDY 

QLCAQALKDGWTRPPPFHTAHLHFSL 

LTLACAETVTSAETPDALAKSRFKVK 


254 


A 


1 


717 


GTRDATAEENRVLLAMVNPTVFFDIA 

VDGEPLGRVSFEVRGLDTKK*LLI*SIK 

LC*QIGGSSIFITSD*KNSCLPLF/QQCL 

LFLRILP\LFADKVPKTAENFRALSTGE 

KGFGL* GVPCFHRIIPGFMCQGGDCE/R 

HHNGTGGKSIYTEKFEDEVNFILKAYG 

VLGSLSMANA\GPNTN\GSQFFICTAKT 

EWLXDGKPWVFGKVKEGMNIVEAME 
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SEQ 

ID 

NO: 


M 

e 

t 

h 

o 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 

<IHU ICSlUUv 

nf amino aril) 

sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, £=GIutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 

P— A r* or i rii tip ^=Spnnp TT— Tlir^Anmo 

xy—x\j guiiut;, ij^jci me, x jl iiiconincj 
V=Valiri<k W=Trr*vritfir\Ti sin Y^TTvi^AciTnk 

T T allJIC) TT Jl I J JSlUfslKtll, x Jl y » value, 

X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 










J. VI VJOXviN vJIV 1 OJTSJrvJLlOlxTJL/ V/ vJ\^l_»JZi 


255 


A 


253 


571 


ICHQPTSLSHQ*ERNGWPGRWHSQPRS 
KFCRLRV\LSQGDHEKKSLPSGQRPHR 
TGCASSSGSSKGLLLLPLDGLGVTVLIN 
PHLWFPGMRAP*LLPR1WLRRWS 


256 


A 


i 
i 


J OH 


VPHVMT TPT701^ A TV A If VPPVNTP T<T VP V 

LDHTADV/QWIVLHRA*IYFFRLHAWG 
DTLEEAFEQCAMAMFGYMTDTGTVEP 
LQTVEVETQ/GWGEEFSLSKHPQGTEV 
KAITYSAMQVYNEENPEVFVIIDI 


AJ 1 


A 


fsl S 
KJ 1 J 




V 1 oo L>JrxvlvJPsJtvKr 

LAVKAPREGRRGPGNESRSEPSQPLDS 
HGPGLRRTFLPPSPRHPTKDRRTAARS 
GPRRKRGQTNEIRGCKEEEGEKYLVPA 
QG 


258 


A 


5751 


6430 


FYFVPSQESVPSASPTGIPKHSLRKTTS 

TEEPRGTHSQGQFTMPLAGMSLGSLKS 

EFVPLFSATPFWVPFSSLPLFPWVLVED 

HVCLLDCVWDLQD\MD\IFAAERHP\R 
tyvqia APPnQQfvnT TPPQVP\/p\r^ r mnQT 

LMEPCRLKLQVERNLDKEISHTVPDISI 
HGNLSSVHCSLDLYKYKLIRGLLENNL 
GEPIEEFMRPYDLQRSKNSYCPEWRSV 
HLYVLPH 


259 


B 


144 


638 


MIVNLFNMFITYGDTFLPTPSSYDELY 
YEIIRMHQSFDNLYSMVLRLSTNAGQ 

YAAVNHISQLSEEQVLEVVRANYDTL 
TLKLQDGLDQYERYSEQHKEAAFFKE 
LVRSI STNVRRNLAFHTLSQEVLLKEFS 
TIS 


260 


B 


30 


2477 


MTPGQLSNVRAPGSAEKGSGDTGDAR 

PPSAAPPGGSAGEARTAGARYLCPRSS 

LSGGAAATRTCGLANPEEEGPSAKCGE 

NGSAERTDLGGNKYNQERIQIEYVEVL 

FADFFREVFAICGSCDALGNWNPONA 

VALLPENDTGESMLWKATIVLSRGVS 

VQYRYFKGYFLEPKENIHHRGDFLVTF 

PSSSRSSFVQTGQFSGRDIDKDPKLSPV 

GRGWGFEWAIELCMAVKEDVRQEVG 
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SEQ 
ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 

flpirl t~p^irtiip 

aVJU J vdJUUC 

of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
x\ /\j gjjijijc, cy^-oeriiie, i — j.ni£onme 9 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop cod on, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 










SfflGLLPDVAMAFVNCRGTDGSVAVR 

MTRGHSHCHLGFAYCASGFSLEPCVE 

NDCGASSAEVQQGFVFITSASSSSSYCT 

EAKRVKLTLEGLEEDDDDRVSPTVLH 

KMSNSLEISLISDNEFKCRHSQPECGYG 

LQPDRWTEYSIQTMEPDNLELIFDFFEE 

DLSEHVVQGDALPGHYGTACLLSSTIA 

ESGKSAGILTLPIMSRNSRKTIGKVRVD 

YIUKPLPGYSCDMKSSFSKYWKPRIPL 

DVGHRGAGNSTTTAQLAKVQENTIAS 

LRNAASHGAAFVEFDVHLSKDFVPVV 

YHDLTCCLTMKKKFDADPVELFEIPVK 

ELTFDQLQLLKLTHVTALKSKDRKESV 

VQEENSFSENQPFPSLKMDGMWDGNL 

STYFDMNLFLDIILKTVLENSGKRRIVF 

SSFDADICTMVRQKQNKYPILFLTQGK 

SEIYPELMDLRSRTTPIAMSFAQFENLL 

^TKrV/HTPT^T T DXTDCVTrMU A V A V \nT!f 

<jJUN vri l HULL-KINrb i lt^JbAJsj\KUL Vlr (_, 

WGDDTNDPENRRKLKELGVNGLIYDR 

IYDWMPEQPMFQVEQLERLKQELPEL 

KSCLCPTVSRFVPSSLCGESDIHVDAN 

GIDNVENA 


261 


A 


1 


3257 


MEPIEGKRSSCHKTGEATAWHCPPG 

WNITMGVEASCAFVGRAGSQDTVRTG 

RALKALTQLRAAQGRGSQGAAAAETG 

LGGRRLRRAPGGGPCVGPRAAAATTL 

SGPRGTAQGHGGGGRSSGKGDQRAHE 

LAAWIPRATRARHTGAAGAEPYYRA 

WGSGEQGRGVCRGLLRLPAGPPTPGR 

ARALAERLSPPRAAPRQDSWPLRGFLP 

PPQPLNPTSASPHPRLFSLLGARPISPW 

TMAATIQAMERKIESQAAHLLSLEGQT 

GMAEKKLADCEKTAVEFGNQLEGKW 

AVLGTLLQEYGLLQRRLENVENLLHN 

RNFWILRLPPGSKGESPKVALGRPGVG 

EAAAKPVSVWFSEQVWGKLEDWQKE 

LCKHVMRGNCEMLVSLDYAISKSEVL 

SQIEQGKEPCNWRRPGPKIPDVPVDPSP 

APVPLPLFCSLYPPGEIHQCSVPAAKQL 

HVVQRTSPVTAKLSTLQPKPHFHLVLH 
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SEQ 

ID 

NO: 


M 

e 

t 

h 

o 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 

uUU 1 VJlUUv 

of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=IsoIeucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 

R = A fori n in p S=Spt"iTip T , =='TVii"pftnin a 

V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 










PTPCQLLKGNTYNPTLTSTPTATACFS 

APLRGRAPWIY1MEGNRLNQCFQTGC 

WRAPGfflQAGEEAPGSRVVFTPJTGSG 

ECRRGPEKSCGFGHSREALGEEWMIR 

KVKVEDEDQEAEEEVEWPQHLSLLP\S 

PFPAPDLGHLAAAYKLEPGAPGALSGL 

ALSGWGPMPEKPYGCGECERPvFRDQL 

TLRLHQRLHRGEGPCACPDCGRSFTQR 

AHMLLHQRSHRGERPFPCSECDKRFSK 

KAHLTRHLRTHTGERPYPCAECGKRFS 

QKIHLGSHQKTHTGERPFPCTECEKRF 

RKKTHLIRHQRIHTGERPYQCAQCARS 

FTHKQHLVRHQRVHQTAGPARPSPDS 

SASPHSTAPSPTPSFPGPKPFACSDCGL 

SFGWKKNLATHQCLHRSNEGRPFGCDE 

CALGATVDAPAAKPLASAPGGPGCGP 

GSDPVVPQRAPSGERSFFCPDCGRGFS 

HGQHLARHPRVHTGERPFACTQCDRR 

Ffi^PPTMT VAW9P. AW^frAJ?PF\Ar , Anr , n 

RRFSRKSHLG\RHQAVHTGSRPHACAV 
CARSSFSSKTNLVRHQGINHTGSRPFSC 
PQCGKSFSRKTHLVRHQLIHGEAAHA 
A\PDAALAAPAWSAPPEVAPPP\LFF 


262 


A 


327 


2561 


ITMGSSGLGKAATLDELLCTCIEMFDD 

NGELDNSYLPRIVLLMHRWYLSSTELA 

EKLLCMYRNATGESCNEFRLKICYFM 

RYWILKFPAEFNLDLGLIRMTEEFREV 

ASQLGYEKHVSLIDISSIPSYDWMRRV 

TQRKKVSKKGKACLLFDHLEPIELAEH 

LTFLEHKSFRRISFTDYQSYVIHGCLEN 

NPTLERSIALFNGISKWVQLMVLSKPT 

PQQRAEVITKFINVAKKLLQLKNFNNL 

IATV GAL\SHRSISGFKGTHS\HLSSEV\T 

KNWNVK*QKWVSSNG\NYCNYRKPFA 

DCDGFKIPILGVHLKDLIAVHVIFPDWT 

EE>0CVNI\VKMHOLSVTLSELVSLONA 

SHHLEPNMDLINLLTLSLDLYHTEDDI 

YKLSLVLEPRNSKSQPTSPTTPNKPWP 

LEWALGVMPKPDPTVINKHIRKLVESV 

FRNYDHDHDGY1SQEDFESIAANFPFL 
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SEQ 
ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 

of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
HNHistidine, I=IsoIeucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=GIutamine, 
x\. — /\rgLuiiJc ? — oerine, i — inreomne, 
V= Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possibIe nucleotide 
insertion 










DSFC VLDKDQDGLI SKDEMM A YFLRA 

KSQLHCQI/GAPGFIHNFQEMT\YLKP\T 

FCEHCAGFIL\GIIKQGYKCKDCGANCH 

KQCKDLLVLACRRMFARAPSLSSGHGS 

LPGSPSLPPAQ\D*VFKFPGVTADNSD\L 

DSRAITLVTGSSRK\TSVRLQRATTSQA 

1 VCJr V WoliAu WUvuouon 1 Lr i INK Y 

SGSLHKP/AKRHKGFAIWEK*KSPGWH 
\AGGDV*NPGT\EFE\LAPDEGEKTT\QD 

KJ \±j U vJJLf 1 o i\JL./\i-»IN L,JSJ\V4 In VJT W L>L^ vj \3 vJIv 

NKKLLRKALASQEVILERTP 


263 


A 


4463 


4703 


RPKMGRRSKHKPPASFQVSSLSNPGFF 
FFI*HCFF*LYFSYKRNVSL*KITHYRKI 
LRRKTFTSETKFFPMKTEPKRVSG 


264 


A 


1 


1941 


MPAPRAREQPRVPGERQPLLPRGARGP 

RRWRRAAGAAVLLVEMLERAAFFGV 

TANLVLYLNSTNFNWTGEQATRAALV 

FLGASYLLAPVGGWLADVYLGRYRA 

VALSLLLYLAASGLLPATAFPDGRSSF 

CGEMPASPLGPACPSAGCPRSSPSPYC 

APVLYAGLLLLGLAASSVRSNLTSFGA 

DXQVMDLGRDATRRFFNWFYWSINLG 

AVLSLLVVAFIQQNISFLLGYSIPVGCV 

GLAFFIFLFATPVFITKPPMGSQVSSML 

KLALQNCCPQLWQRHSARSKLSQGQQ 

GNNGSESKLHLLVAKWQHTLGRVELT 

VAVFGDDYTNIVPFGISKDSARLLDKK 

RDRQCARVLADERSPQPGASPQEDIAN 

FQVLVKILPVMVTLVPYWMVYFQMQ 

STYVLQGLHLHIPNIFPANPANISVALR 

AQGSSYTIPEAWLLLANWWLILVPL 

KDRLIDPLLLRCKLLPSALQKMALGMF 

FGFTSVIVAGVLEMERLHYIHHNETVS 

QQIGEVLYNAAPLSIWWQIPQYLLIGIS 

Jcrir/voijrvjJLuir/v i scat ivoJviv^ijr/ViMVjur 

FNCLSGVGSLLGS SL VGTAVPLPGGWL 

HCPKDFGNINNCRMDLYFFLLAGIQAV 

TALLFVWMAGRYERASQGPASHSRFSR 

DRG 


265 


B 


46 


774 


XACSGVPGTKCSPPSGSGYPNPYSKHV 
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SEQ 

ID 

NO: 


M 

e 

t 

h 

o 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=PhenylaIanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline> Q=Glutamine, 
R=Arrinine. S=Serine. T=Threnninp 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 










LTEDIVHREVTPDQKLLSRATLTKTNR 

NAHAGPERLFPANVAHSVYVLEDSIVD 

PQNQTLTTFNWNINPRPG 


266 


A 


3 


2523 


SSLTSSMEDPAAPGTGGPPANGNGNXG 

GGKGKQAAPKGREAFRSQRRESEGSV 

DCPTLEFEYGDADGHAAELSELYSYTE 

NLEFTNNRRCFEEDFKTQVQGKEWLE 

LEEDAQKAYIMGLLDRLEWSRERRL 

KAARAVLYLAQGTFGECDSEVDVLH 

WSRYNCFLLYQMGTFSTFLELLHMEID 

NSQACSSALRKPAVSIADSTELRVLLS 

VMYLMVENIRLERETDPCGWRTARET 

FRTELSFSMHNEEPFALLLFSMVTKFCS 

GLAPHFPIKKVLLLLWKWMFTLGGFE 

HLQTLKVQKRAELGLPPLAEDSIQWK 

SMRAASPPSYTLDLGESQLAPPPSKLR 

GRRGSRRQLLTKQDSLDIYNERDLFKT 

EEPATEEEEESAGDGERTLDGELDLLE 

QDPLVPPPPSQAPLSAERVAFPKGLPW 

APKVRQKDIEHFLEMSRNKFIGFTLGQ 

DTDTLVGLPRPIHESVKTLKQHKYISIA 

DVQIKNEEELEKCPMSLGEEVVPETPC 

EILYQGMLYSLPQYMIALLKILLAAAP 

TSKAKTDSINILADVLPEEMPITVLQSM 

KLGIDVNRHKJEIIVKSISTLLLLLLKHF 

KLNHIYQFEYVSQHLVFANCIPLILKFF 

NQNILSYITAKNSISVLDYPCCTIQDLPE 

LTTESLEAGDNSQFCWRNLFSCINLLR 

LLNKLTKWKHSRTMMLVVFKSAPILK 

RALKVKQAMLQLYVLKLLKLQTKYL 

GROWRKSNMKTMSAIYOKVRHRMND 

DWAYGNDIDARPWDFQAEECTLRANI 

EAFNSRRYDRPQDSEFSPVDNCLQSVL 

GQRLDLPEDFHYSYELWLEREVFSQPI 

CWEELLQNH 


267 


A 


150 


1076 


PAWNARPRQVDLKLTHKKQRALLERF 

D1YRKVPKDLTQPTYTGAIISICCCLFIL 

FLFLSELTGFITTEWNELYVDDPDKDS 

GGKIDVSLNISLPNLHCELVGLDIQDE 

MGRHEVGHIDNSMKIPLNNGAGCRFE 
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SEQ 

ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=AIanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, KHLysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q-Glutamine, 
R=Arffinine» S=Serine* T=Threoninp 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibie nucleotide 
insertion 










GQFSINKVPGNFHVSTHSATAQPQNPD 
MTfWIHKLSFGDTLQVQNIHGAFNAL 
GGADRLTSNPLASHDYTT KTVPTVYFD 

KSGKQRYSYQYTVANKEYVAYSHTG 
R1IPAIWFRYDLSPITVKYTERRQPLYRF 
ITTICAIIGGTFTVAGILDSCIFTASEAW 
KKIQLGKMH 


268 


B 


264 


1999 


MLIYSSKTLELRETSVTPSNLWGGQGL 
LGVSIRFCSFDGANENVWHVLEVESNS 
PAALAGT RPHSDYirGADTVMNFSFDT 

FSLIETHEAKPLKLYVYNTDTVYTGNS 
TWKTCVKSSYSGALVNLNRLFSSAYT 
RIPCFGALRINSDKHFVNGCWLLSTYT 
L 


269 


A 


67 


906 


NLLLGGGGKKKKPPRTRGPFPGLSQPG 

LLWLFPKRPGCSHLPSTPIKEMGLPKIH 

HRVGWESFSGVFLEVDFKIYKKKMNE 

FFSVDDNNEEEEDVEMKEDSDENGPE 

EKQSVEEMEEQSQDADGVNTVTVPGP 

ASEEAVEDCKDEDFAKDENITKGGEV 

TDHSVRDODHPDGOFNDSTKNFTKTFT 

ESQSSYMETEELSSNQEDAVIVEQPEVI 
PLTEDQEEKEGEKAPGEDTPRMPGKSE 
GSSDLENTPGPDVEMNSQVDKVNDPT 
ESQPSCQA*RSRG 


270 


A 


401 


881 


ERRERSPDQSSGRASRGPPERQSLRMS 

PSRAAWTSSPCRSCASQGVCAWPLNL 

RRIASTSWC*PMSAGIGPMAWWPSTT 

GPCMMSTVSTMAKPHRECPGCFVPFA 

VCVVSRFPYYNSLKDCLSWHYRRPGA 

TLLSPSSLVTLLLVKGPGAAAADAGEI 

PV 


271 


A 


184 


581 


ASAPVGCLTRAVCGRPPWRTNTVVEP 
REGTRILEFGHLKLAHVPPLEFLVNOH 

x i J \_# Ji 1 1 tXJX VJ X X JL/ J.^ 1 tlXX X If X X XjXJX Xj V 1 1 \J X X 

QPEDHVLIKRWKEEKLEPAWEGPYPV 
LLTTKTAVRT/DKKKKKKKKRWTHHT 
QVKKVPPPPESWAIVPGENPTKLKLRK 
M 


272 


A 


1 


3802 


MRRGGWRKRAENDGWETWGGYMAA 
KVQKLEEQFRSDAAMQKDGTSSTIFSG 
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SEQ 

ID 

NO: 



M 

e 

t 

h 

o 

d 



Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 



Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 



Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P-Proline, Q=Glutamine, 
R=Arginine, S=Serine, T=Threonine, 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 



VAIYVNGYTDPSAEELRKLMMLHGGQ 

YHVYYSRSKTTHIIATNLPNAKIKELK 

GEKVIRPEWIVESIKAGRLLSYIPYQLY 

TKQSSVQKGLSFNPVCRPEDPLPGPSNI 

AKQLNNRVNHIVKKIETENEVKVNGM 

NSWNEEDENNDFSFVDLEQTSPGRKQ 

NGIPHPRGSTAIFNGHTPSSNGALKTQD 

CLVPMVNSVASRLSPAFSQEEDKAEKS 

STDFRDCTLQQLQ Q STRNTD ALRNPHR 

TNSFSLSPLHSNTKINGAHHSTVQGPSS 

TKSTSSVSTFSKAAPSVPSKPSDCNFIS 

NFYSHSRLHHISMWKCELTEFVNTLQR 

QSNGIFPGREKLKKMKTGRSALVVTD 

TGDMSVLNSPRHQSCIMHVDMDCFFV 

SVGIRNRPDLKGKPVAVTSNRGTGRAP 

LRPGANPQLEWQYYQNKILKGKAADI 

PDSSLWENPDSAQANGIDSVLSRAEIA 

SCSYEARQLGIKNGMFFGHAKQLCPN 

LQAVPYDFHAYKEVAQTLYETLAS\YT 

HNIEAV S CDEAL VDITEIL AETKLTPDE 

FANAVRMEIKDQTKCAASVGIGSNILL 

ARMATRKAKPDGQYHLKPEEVDDFIR 

GQLYTNLPGVGHSMESKLASLGIKTCG 

DLQYMTMAKLQKEFGPKTGQMLYRF 

CRGLDDRPVRTEKERKSVSAEINYGMR 

FTQPKEAEAFLLSLSEEIQRRLEATGM 

KGKRLTLKIMVRKPGAPVETAKFGGH 

GICDNIARTVTLDQATDNAKIIGKAML 

NMFHTMKLNISDMRGVGIHVNQLVPT 

NLNPSTCPSRPSVQSSHFPSGSYSVRDV 

FQVQKA\KKSTEEEHKEVFRAAVDLEI 

SSASRTCTFLPPFPAHLPTSPDTNKAES 

SGKWNGLHTPVSVQ\SRLNLSIEVPSPS 

QLDQSVLEALPPDLREQVEQVCAVQQ 

AESHGDKK\KEPVNGCNTG'ILPQPVGT/ 

MSLLQ1P\EPQESNSDAGINLIALPAFS\Q 

VDPEVFAALSVAELQRELKAAYDQRQR 

QGENSTHQQS\ASASVPKNPLI\HLKAA\ 

VKEKKRNKKKKTIGSPKRJQSPLNNKL 

LNSPAKTLPGACGSPQKLIDGFLKHEG 
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SEQ 
ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 

HW1U 1 WtJlUUXs 

of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=GIutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucme, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 

gllllUC, O OCI 111 V) X X Ul CUUlllC, 

V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible nucleotide 
insertion 










PPAEKPL/EKNSSGFLLSGVPG\LSSLQS 
DP/SLGOVRPPPPYNT \AGAVFFTsTnWTT 

LR\EWVTTISDPMEEDILQ\WKYCTDL 
IEDKDLEKLDLVIKYMKRLMQQSVES 
VWNMAFDFILDNVQWLQQTYGSHIK 
SYINITQRA 


273 


A 


1 


785 


MAETEERSLDNFFAKRDKKKKKERSN 

RAASAAGAAGSAGGSSGAAGAAGGG 

AGAGTRPGDGGTASAGAAGPGAATK 

AVTKDEDEWKELEQKEVDYSGLRVQ 

AMQISSEKEEDDNEKRQDPGDNWEEG 

GGGGGGMEKSSGPWNKTAPVQAPPAP 

VTVTFTPFPAMTSfrVVRPPfrART TTTP 

KTP\QGPPEI YQ * YHSSHPL AVNLPKHV 
ESRKDKEMEKSFEVVRHKNRGRDEVS 
KNQALKLQLDNQYAVLNENQKSSHSQ 
YN 


274 


A 


463 


828 


HT PIT Pn^RTH^YFT T<IT PRTJMWPWm 

■Tii-'iVlJUIVJL/OIV.inO I PL I OL>i\OJC_<lNlNl VV 1 U 

QSPCAAASRAQHLHPAAVAAATMPKT 
KAEGDAKGDKAKVKDEPQVTRAAIQT 
NTFIFKC*IEPQKQIYILYIQNSCQISLLI 
LPKSTLMKWMQTL 


275 


A 


3 


1901 


SSVEQASVEVPDGPTLHDPDLYIEIVKN 

TKSVPEYSEVAYPDYFGHIPPPFKEPIL 

ERPYGVQRTKIAQDIERLIHQSDIIDRV 

VYDLDNPNYTIPEEGDILKFNSKFESGN 

LRRVIQIRKNEYDLILNSDINSNHYHQ 

WFYF\EVSGMRPGVAYRFNIIN\CE\RC 

NRLFNYGMQPLMYSVQEALNARPWW 

IRMGTDIRYYINHFSRSSVAAGGA/QRG 

KSYYTITFTVQFST*RMDVCYFA/YIHY 

PYTY\STLQMHLQKLESAHNPQQIYFR 

KDVLCETLSGNSCPLVTITAMPESNYY 

EHICHFRNRPYVLMYARVHPG\ET\NAS 

WG YERERWE YLHE AINPTGFRSLRRN 

LYY/IFKIVPMLNPDGVINGNHRCSLSG 

EDLNRQWQSPSPDLHPTIYHAKGLLQY 

LAAVKRLPLVYCDYHGHSRKKNVFM 

YGCSIKETVWHTNDNATSCDWEDTG 

yrtlpkilshiapafcmsscsfwek.sk 
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SEQ 

ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E-Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
EKHistidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
R = Arffinine. S=Serine. T = Threonine. 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe nucleotide 
insertion 










ESTARVVV\*RE1GVQRSYTMESTLCGC 

DOGKYKGLOIGTRELEEMGAKFCVGL 

L\RLKRLTSPLEY\NPALPSPALTFENDL 

N*IQACKVTSPYPLMSLDEDEP\RF\LEE 

VDYSAESNDELDIELAENVGDYEPSAQ 

EEVLSDSELSRTYLP 


276 


A 


1076 


3511 


IKALSSSAEDASLVNASISSSVKATSPV 

KSTTSITDAKSCEGQNPELLPKTPISPLK 

TGVSKPIVKSTLSQTVPSKGELSREICL 

QSQSKDKSTTPGGTGIKPFLERFGERC 

QEHSKESPARSTPHRTPIITPNTKAIQER 

LFKQDTSSSTTHLAQQLKQERQKELAC 

LRGRFDKGNIWSAEKGGNSKSKQLET 

KQETHCQSTPLKKHQGVSKTQSLPVTE 

KVTENQIPAKNSSTEPKEVIREIEMSVD 

DDDINSSKVINDLFSDVLEEGELDMEK 

SQ/AGDGSSISPv/TAAKNRKMH*ISPQC 

LYLHHWHKQLV* V* CPHLDWN*KTPA 

EVMKVQNQENSKELVS/RRAESGDSLG 

SEDRDLLYRSQRFKETERPS1KQVIVRK 

EDVTSKLDEKNNAFPCQVNIKQKMQE 

LNNEINMQQTVIYQASQALNCCVDEE 

HGKGSLEEAEAERLLLIATGKRTLLIDE 

LNKLKNEGPQRKN*G*S/APSEFIAIPKD 

QFTLSEIRLP*KADFVCSTVQKPDAAN 

YYYLIILKSRS\ENMVATPLASTSNSLN 

GDALTFTTTFTLQDVSNDFEINIEVYSL 

VQKKDPSGLDKKKKTSKSKKSNIHSSV 

MASPGGLSAVRTSNFALVGSYTLSLSS 

VGNTKFVLDKVPFLSSLEGHIYLKIKC 

QVNSSVEERGFLGCPGGGRLQPKRQTI 

FEDVSGFGAWHRRWCVLSGNCISYWT 

YPDDEKRKNPIGRINLANCTSRQIEPAN 

REFCARRNTFELITVRPQREDDRETLVX 

TNAGTHSVFTKNWLSADTKEERDLW 

MQKLNQVLCDIRLWQPDACYKPIGKP 


277 


B 


1 


2319 


MQINETIWDTVGAASRHGEGERQAKS 
STRGCTHLAEGQGIYLQEEQSPPEMCT 
RVMEKREGLTIERERDPLLPVWKALGI 
Q AHKC V AHTTN P SKATA VHLPHLTMQ 
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SEQ 
ID 

NO: 


M 

e 

t 

h 

o 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 

o /> i rl 
itClil 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
«icju resiuue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
IV— Arginine, o— serine, i—inreonine, 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 










PQGCLMSFFPTAAEFSTYGQELYLENN 

QIEEITEICFNHTPvKINVIVLRYNKIEEN 

RIAPLAWINQENLESIDLSYNKLYHVPS 

YLPKSLLHLVLLGNQIERIPGYVFGHM 

EPGLEYLYLSFNKLADDGMDRVSFYG 

AYHSLRELFLDHNDLKSIPPGIQEMKA 

LHFLRLNNNKIRGNKQEIKQTSKQASA 

VQSEKWVTMRRAHWGLRAARRLRPP 

STAWINSRSRPVPVEQTHCGLAVAEER 

KDLFMFFRSLHFFVEWFEYRKRTFKHL 

KWDEDYDQEPDDDYQTGFPFRQNVD 

YGVPFHQYTLGCVSECFCPTNFPSSMY 

CDNRKLKTIPNIPMHIQQLYLQFNEIEA 

VTANSFINATHLKEINLSHNKIKSQKID 

YGVFAKLPNLLQLHLEHNNLEEFPFPL 

PKSLERLLLG YNEI SKLQTN AMD GL VN 

LTMLDLCYNYLHDSLLKDKIFAKMEK 

LMQLNLCSNRLESMPPGLPSSLMYLSL 

ENNSISSIPEKYFDKLPKLHTLRMSHNK 

LQDIPYNIFNLPNIVELSVGHNKLKQAF 

IDPLHYHHLTYIRVDQNKLKEPISSYIFF 

CFPHIHTIYYGEQRSTNGQTIQLKTQVF 

t> p ppnrmTYE CT7nunnpn\T a tjtt c dt; r\t? 
Ivtvr r U D ULf&or,lJri JJJL'rlJlNAxiJDoriil s /Jb 

GAEGHFDLHYYENQE 


278 


A 


65 


262 


SRRRGGVSAPTSFYGRDRRMFPAQEE 

ADRTVFVGNLEARVREEILYELFLQVL 

CPREMGILSISP 


279 


A 


1 


892 


MSRWGAAVGQGALREEHFAHAHITER 

TRRVREGRRKRRSSLLTTSPTSANAQA 

HFLKLKVSIDKGPQNRAGAIVPWFAK 

MSFPKYKP\SSLRTLP\ETLDPAEYNISP 

ETRRAQA\ERLAHR\AQL\KREYLLQYN 

DPN\RRGLIENP\ALLRWAYARTINVY\P 

NFKP\TPKSSLMGAFVWDFGPLIFI\YYII 

KTERWDPNORWLTDSRII KYFATT T PR 

DDLTLTTDNSLNPAAFLRGNPNPEEPE 
HKCLDLISYQTRVRLDLSKTPFQTGRH 
LFIDGSSLVIGGKGHNGYSVVDGETLT 
K 
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SEQ 

ID 

NO: 



280 



M 

e 

t 

h 

o 

d 



B 



Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 



1 



Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 



597 



Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
R=Arginine, S=Serine, T=Threonine, 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 



MQTFTTCISYSEYSCMLLANASSHGTL 

YCKLRVGICLLMVPAVKNQASGSARG 

ATKVRRKCQAGCQNEHLGELDDGTD 

GKNQLN1RENGGRGQNCEQELEESVA 

EKDLSQTSRDLEKMMSKHFLKPMLSI 

SDLVNFLMQVSKVLVKTAEGIVLQQL 

PLAFPALHFHAYGNLFPVCSFKHYIYM 

IDHPIFISIPDFLT 



281 



4061 



MPVPSRHINIGRSQSWDAAGWYEGPW 

ENAESLRPLGPvRSSLTYGTAEGTWFEP 

NHRPQDAALPVAAEPYLYREAVYNSV 

AARKGSTPDFTFYDSRQAVMSGRSPLL 

PREYYSDPSGAARVPKEPPLYRDPGVS 

RPVPSYGVLGSRTSWDPMQGRSPALQ 

DAGHLYRDPGGKMIPQGRQTQSRAAS 

PGRYGREQPDTRYGAEVPAYPLSQVFS 

DISERPIDPAPARQVAPTCLVVDPSSAA 

APEGSTGVAPGALNRGYGPARESIPSK 

MAYETYEADLSTFQGPGGKRTVLPEFL 

AFLRAEGLAEATLGALLQQGFDSPAVL 

ATLEDADIKSVAPNLGQARVLSRLANS 

CRTEMQLRRQDRGGPLPRARSSSFSHR 

SELLHGDLASLGAAAPLQTASPRAGDP 

ARRPSSAPSQHLLETAATYSAPGVGTH 

APHFPSNSGYSSPTPCALTARLSPTYPL 

QAGVALTNPGPSNPLHPGPRTAYSTAY 

TVPMELLKRERNVAASPLPSPHGSPQV 

LRKPGAPLGPSTLPPASQSLHTPHSPYQ 

KVARRTGAPIIVSTMLAPEPIQFAGQA 

VQSDNVRKAYAAGTPVRPTSPGDTDK 

WGLQARAPGRAVDPRNMfSAQEHKV 

VECMARRSATCFVFGQLCRLHSTSSDP 

VGVDFILSMEDVGRGKSRNPDSWSPN 

AVVWDASGVGGERVLQYQLDMNTVP 

PQGWTTRKTRVCCKHEASPSPISALAA 

IAKEEGVILLLWTFTLGNKRLGGSATR 

VGYAEAQAEAPSCKATTVTLSSGSSHE 

CDSSVSSKTATCRDFMGQPWGHASIPP 

TPNPPPPAVVPGIFSQHENPLAFLFSRL 

AMKDLLPGFEPQTLDRSRASLSHVLRA 
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SEQ 

ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 

UtlU J CiSlUuv 

of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K==Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
xv /\rginme 9 o — oenne, i — inreomne, 
V=VaIine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e nucleotide 
insertion 










RPSGRVEGIRPQIMNGPLHPRPLVALL 

DGRDCTVEMPILKDLATVAFCDAQST 

QEIHEKVLNEAVGAMMYHTITLTRED 

LEKFKALRVIVRIGSGYDNVDIKAAGE 

LGECEAALAAWSCPELCGPCSGGLGE 

AAGTGTTEQPLLAVARWLPPGRAVEH 

LAALPSHDTGIAVCNIPSAAVEETADST 

ICHILNLYRRNTWLYQALREGTRVQSV 

EQIREV AS GAAR1RGETLGLIGFGRTGQ 

AVAVRAKAFGFSVIFYDPYLQDGIERS 

LGVQRVYTLQDLLYQSDCVSLHCNLXN 

EHNHHLMNDFTIKQMRAGSIPLWNAA 

RGGLVDEKALAQALKEGRJRGAALDV 

HESEPFSFAQGPLKDAPNLICTPHTAW 

LRNCVNKEFFVTSAPWSVIDQQAIHPE 
LNGATYRYPPGIVGVAPGGLPAAMEGI 
IPGGIPVTHNLPTVXAHPSQAPSPNQPTK 
HGDNREHPNEQ 


282 


A 


29 


573 


LLKISGIILKTGESQNQLAVDQIAFQKK 
LFQTLRRHPSYPKI1EEFVSGLESYIEDE 

LIKLLLGIDILQPAIIKTLFEKLPEYFFEN 
KNSDEINIPRLIVSQLKWLDRVVDGKD 
LTTKIMQLISIAPENLQHDIITSLPEILGD 
SQHADVGKEL 


283 


A 


927 


5088 


KRKRRRTWKRYRSIIDHLQEKRREVTL 

RVDTYTLVQPEAEDHVESYRSMPIYPT 

YNEVHLDERPFLRPN1ISGKYDSTAIYL 

DTHFRLLREEIVRPLREGILELLQSFED 

QGLRKRKFDDIRIYFDTRIITPMCSSSGI 

VYKVQFDTKPLKFVRWQNSKRLLYGS 

LVCMSKDNFETFLFATVSNREQEDLCR 

GIVQLCFNEQSQQLLAEVQPSDSFLMV 

ETTAYFEAYRHVLEGLQEVQEEDVPF 

ORNIVECNSHVKEPRYLLMGGRYDFT 

PLIENPSATGEFLRNVEGLRHPRINVLD 

PGQWPSKEALKLDDSQMEALQFALTR 

ELAIIQGPPG\TGKTYVGLKIVQALLTN 

ESVWQISLQKFPILVVCYTNHAL\DQFL 
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SEQ 
ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue ui 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
oi amino aciu 
sequence 


Amino acid segment containing signal 
peptide (A=AIanine C=Cysteine, 
D=Aspartic Acid, E=GIutamic Acid, 
F=PhenyIalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=GIutamine, 
R=Arginine, S=Serine, T=Threonine, 
v- valine, w-i rypiopnau, i — tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 










GRHLQLSGRPGIVRVGWKGATVEIPEG 

SFTLRELRNKREFRKNLPMHLRRAYM 

SIMTQMKESEQELHEGAKTLECTMRG 

VLREQYLQKYISPPALGKSHEWPQCRI 

VNGFSSQHWKHSHDAGVA*VLVSVLS 

RKVFLQQDLRIQPQAEGDEEEEGEE/RE 

FRLIRDSQREADPDFKQTG*LRRKRW* 

GPSGGRRKRVEQTRS WLKCF WP* G *T1 

VALGQQLDRSKPQESGRPS/DNQKKK 

MKKRVKDELRKLNTMTAAE ANEI EDV 

WQLDLSSRWQLYRLWLQLYQADTRR 

KILSYERQYRTSAERMAELRLQEDLHI 

LKDAQVVGMTTTGAAKYRQILQKVEP 

RIVIVEEAAEVLEAHTIATLSKACQHLI 

LIGDHQQLRPSANVYDLAKNFNLEVSL 

FERLVKVNIPFVRLNYQHRMCPEIARL 

LTPHI YQDLENHPS VLK YEKIKG V S SN 

LFFVEHNFPEQE\SKRRKSHQNQHEAH 

NVVELCKYFLCQEYLPSQITILTTYTGQ 

LFCLRKLMPAKTFAGVRVHVVDKYQ 

GEENDIILLSL VRSNQEGKVGFLQI SNR 

ICVALSRAKKGMYCIGNMQMLAKVPL 

WSKIIHTLRENNQIGPMLRLCCQNHPE 

THTLVSKASDFQKVPEGGCSLPCEFRL 

GCGHVCTRACHPYDSSHKEFQCMKPC 

QKVICQEGHRCPLVCFQECQPCQVKVP 

KTIPRCGHEQMVPCSVPESDFCCQEPC 

SKSLRCGHRCSHPCGEDCVQLCSEMV 

TIKLKCGHSQPVKCGHVEGLLYGGLL ! 

VKCTTKCGTILDCGHPCPGSCHSCFEG 

RFHERCQQPCKRLLICSHKQQKPCIGE 

CPPCQRTCQNRCVHSQCKKKCEELCSP 

CVEPMCSRCQHYQCTKLCSEPCNRPPC 

YVPCTKLLVCGHPC1GLCGEPCPKKCR 

irHMnPVTOTPFnppnppnA pirvrtt pn 
iv^nivii_yn v i v^irrijrxii^cr u/\t\X vyLtu 

CSHMFEVQALDRYMNEQKDDEVAIRL 

KVCPICOVPIRKNLSYGTSIKORLEEIEII 

EEKYPGLIRGNGNQPGTA 


284 


A 


381 


2040 


AERKLSEKSLVVAAVAPDNRNPAFTT 
MGWLFLKVLLAGVSFSGFLYPLVDFCI 
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SEQ 
ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 

dvlU rCMUUv 

of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=GIycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
iv— /\rgmine, o — serine, i — inreonine, 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 


. -. 


/.; . 






SGKTRGQKPNFVIILADDMGWGDLGA 

NWAETKDTANLDKMASEGMRFVDFH 

AAASTCSPSRASLLTGRLGLRNGVTRN 

FAVTSVGGLPLNETTLAEVLQQAGYV 

TGIIGKWHLGHHGSYHPNFRGFDYYF 

GIPYSHDMGCTDTPGYNHPPCPACPQG 

DGPSRNLQRDCYTDVALPLYENLNIVE 

QPVNLSSLAQKYAEKATQFIQRASTSG 

RPFLLYVALAHMHVPLPVTQLPAAPR 

G/RKSLYGAGLWEMDSLVGQIKDKVD 

HTVKENTFLWFTGDNGPWAQKCELA 

GSVGPFTGFWQTRQGGSPAKQTTWEG 

GHRVPALAYWPGRVPVNVTSTALLSV 

L\DIFPTWALAQAS\LPQGRRFDGVDV 

SEVLFGRSQPG\HRVLF\HPNSGAAG\D 

STGPELQHKFPLIFNLEDDTAEAVPLER 
GGAEYQAVLPEVRKVLADVLQDIAND 
NISSPDYTQDPSVTPCCNPYQIACRCQA 
A 


285 


A 


1 


885 


PVATTISQPLSLEADMWSIGVITYILLS 

GASPFLGDTKQETLANITAVSYDFDEE 

FFS\ETSELAQDF1RKLLG*ETRKRVTIQ 

EALRHPWITSKGEGRAPEQRKTEPTQL 

KTKHLREYTLKCHSSMPPNNCYVNFE 

RFACVVEDVARVDLGCRALVEAHDTI 

ODTiVPAT VQT'FWF'K'F A WVRTkPMUC AT? 

KfiJXJ V H/rtJU V OlrlNXZ/JSJDrV W 1 ivUljiN JDO/VJK 

HDLSQLRYEFRKVESLKKLLREDIQAT 
GCSLGSMARKT DHT OAOFFTT PfYFT <?A 
DLOWIOELVGSFOLESGSSFGT G9TFY 
QDTSESLSELLSRSCTEEFLAGWKL 


286 


C 


187 


342 


MVPVFSVEKDGEELGSFRPRWADWLT 
GLLEWVSVESLSIYCISQPVYMWVE 


287 


C 


188 


207 


MWHLSV 


288 


A 


153 


503 


WPT-TSPTYPOSAT ftS'S'SnftWT PAPT QPr*P 

G*AGAGGGRRCRGRPWSRAG*ACSGH 
AGSRCCPA* SVCGGLPGGAPGCLCKG 
GSAGFCCQGPGCSCSGCSGSGHGGYR 
HRQGRPLSASQ 


289 


A 


1 


4964 


SVYKADLEWLRGIGWMPEGSVEMNR 
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SEQ 
ID 

NO: 



M 

e 

t 

h 

o 

d 



Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 



Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 



Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=PhenyIaIanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
R=Arginine, S=Serine, T=Threonine, 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 



VKVAQDLVNERLYRTRPEALSFTSIVD 

TPEVVLAKANSLQISEKLYQEAWNKD 

KSNITIPSDTPEMLQAHINALQISNKLY 

QKDWNDTKQKGYDIRADAIEIKHAKA 

SREIASEYKYKEGYRKQLGHHMGFRT 

LQDDPKSVWAIHAAKIQSDREYKKAY 

EKSKGIHNTPLDMMSIVQAKKCQVLV 

SDIDYRNYLHQWTCLPDQNDVIQAKK 

AYDLQSDPLYRNAWEKEKANVNVPA 

DTPLMLQSKINALQISNKRYQQAWED 

VKMTGYDLRADAIGIQHAKASRDIAS 

DYLYKTAYEKQKGHYIGCRSAKEDPK 

LVWAANVLKMQNDRLYKKAYNDHK 

AKISIPVDMVS1SAAKEGQALASDVDY 

RHYLHHWSCFPDQNDVIQARKAYDLQ 

SDTEPCSLAQAGVQWVADMTARGQSP 

LAPLLETLEDPSASHGGQTDAYLTLTS 

PvMTGEEGKEVITEIEKKLPRLYKVLKV 

SSIIDSLEILFNKGETHSAWDFEALNVI 

VRLIEQAPIQMGEEAVRWAKLVIPLW 

HSAQKVHLRGATALEMGMPLLLQKQ 

QEIASITEQLMTTTLHRSGSFINSLLQLE 

ELGFRSG APMIKKIAFIA WKSLIDNF AL 

NPDILCSAKRLKLLMQPLSSIHVRTETL 

ALTKLEVWWYLLMRLGPHLPANFEQ 

VCVPLIQSTISIDSNASPQGNSCHVATS 

PGLNPMTPVHKGASSPYGAPGTPRMN 

LSSNLGGMAHPSIQLLGLEMLLHFLLG 

PEALSFAKQNKLVLSLEPLEHPLISSPSF 

FSKHANTLITAVHDSFVAVGKDAPGN 

KKEKPGSEVLTLLLKSLESF/KSEVFPV 

SKTLGTPALFLIQLIFNNFLECGVSDER 

FFLSLESLVGCVLSGPTSPLAFSDSVLN 

VINQNAKQLENKEHLWKMWSVIVTPL 

TELINQTNEVNQGDALEHNFSAJYGAL 

TLPVNHIFSEQRFPVATMKTLLRTWSE 

LYRAFARCAALVATAEENLCCEELSSK 

IMSSLEDEGFSNLLFVDRJIYIITVMVDC 

IDFSPYNIKYQPKVKSPQRPSDWSKKK 

NEPLGKLTSLFKLrVKVIYSFHTLSFKE 
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SEQ 

ID 

NO: 


M 

e 

t 

h 

o 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 

AVIV* 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 

uVlU ICdlUUv 

of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
jcv— /\rgimnc 9 o — oenne, l — i nreomne, 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e nucleotide 
insertion 










AHSDTLFTIGNSITGIISSVLGHISLPSMI 

RKIFATLTRPLALFYENSKLDEVPKVY 

SCLNNKLEKLLGEIIACLQFSYTGTYDS 

ELLEQLSPLLCnFLHKNKQIRKQSAQF 

WNATFAKVMMLVYPEELKPVLTQAK 

QKFLLLLPGLETVEMMEESSGPYSDGL 

KLESSSLKVKGEILLEEEKSTDFVFIPPE 

GKDAKERILTDHQKEVLKTKRFEEQM 

DSDIVIPQDVTEDCGMAEHLEKSSLSN 

NECGSLDKTSPEMSNSNNDERKKALIS 

SRKTSTECASSTENSFVVSSSSVSNTTV 

AGTPPYPTSRRQTFITLEKFDGSENRPF 

SPSPLNNISSTVTVKNNQETMIKTDFLP 

KAKQREGTFSKSDSEKIVNGTKRSSRR 

AGKAEQTGNKRSKPLMRSEPEKNTEE 

SVEGIWLENNPPGLLNQTECVSDNQV 

"PT ^PQTA/fT7T4TYhJTT<rT V A A TVENT AA7T T XX 

TNTVEEKNVEINLESKENTPPVVISADQ 
MVNEDSQVQITPNQKTLRRSSPvRR/YR 
SSRVYH*KPR*GK* SSKKGTT*GRRKTS 
SEESIAYKR 


290 


A 






YTi A VTV/fFK"Tf fTT T AT AT VT7QA/TPVT? A AT7TJ 

WIDVRVPEQYQQEHVQGAINIPLKEVK 
ERIATAVPDKNDTVKVYCNAGRQSGQ 
AKEILSEMGYTHVENAGGLKDIAMPK 
VKG 


291 


A 


2 


359 


S^P^PT-TT VK'tfTKTK' MTf^PAT POT <3Pnu 

TKSPVTFWWMTFGDTSRPSQDTLPMD 
LQQLLGVTKVCSKATSPTSQRGQEVIS 
TPTSKSGPFIGRGS*G*SGRWERPSCCL 
HFSYPQLRGLC \ 


292 


A 


834 


1913 


REPAGAGAYMRACARVRRRGDRRPR 

RSPRPRDPAVRARARSAPPPLFIAAAG 

GGGSGWRLYADSGEEYGIMAFALFVL 

LG\FALLGTHGA\SGAAGTVFTTVEDL\ 

GSKILLTC\SLNDSATEVTGHR\W\LKG 

GWLKE\DALPGQKT/ESFKVDSDDHV 

GMKYS\CVFLP\EPHGHGPTIQASTGPP 

RVEGL* SSFRTHSTRGKTGLVGSCK\SE 

FVPPWTDW/APWYKITDSEPQGPSLNA 



139 



WO 01/74836 



PCT/USO 1/10472 



SEQ 

ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=PhenyIalanine, G=Glycine, 
ENHistidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
R—Arffinine. S=Serine T=TTirpnnino 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 










ORTRFFWGPS AGPVK S YOT-T *FPFmrrP 

PPARNRCNGTSSKGLRPRPLQFLRVRT 

AT*AAL\WPFLGIVG\EVLVL\VTI\IFIYE 

KRRK\PEDVL\DDDDAGSAPLKE/SAGQ 

HQNVDKGKKRSARGNFS 


293 


A 


1 


936 


MKVLLESVKERAEEEKLAAAHLRSFA 

AKKAKKYDSVKKEKTLQDVDLTQHQ 

HKQTRALSGGLKRKLSLGIAFMGMSR 

TVVLDEPTSGVDPCSRHSLWDILLKYR 

EGRTIIFTTHHLDEAEALSDRVAVLQH 

GRLRCCGPPFCLKEAYGQGLRLTLTRQ 

PSVLEAHDLKDMACVTSLIKIYIPQAFL 

LDENLHQLHLTGYGISDTTLEEAEGRT 
AAPEPPMLEDGHAVTQRFSFIQWGCE 
DDRTTWVQAQGASAPGGQRPQEDLPS 
FPQDGRSRAQFKDPHQFSN 


294 


A 


1 


1743 


MASHAYDKNQNANVLVHLCFYNRJPK 

TGAYYLDSRSVSISYLIGHHIDMGLET 

ATSKNEFIFDSASTLLGMLFRKPSQHSL 

SLFSKKFQENLIYLESDDCLPPPPPPPW 

SEPPSFLTWTTVTVFQWVSLLLSLPNIQ 

VILYRAVGVVPSQPKSDNLKGWGSGR 

WKEKLRSEIPDWKIKSIHBLERTASSST 

EPSVSRQLLEPEPVPLSKEADSWEIIEG 

LKIGQTNVQKPDKHEGFMLKKRKWPL 

KGWHKIQKGKVHGSIDVGLSVMSIKK 

KARRIDLDTEEHIYHLKVKSVFNSFSAI 

IRGNDLPTPWKSQDWFDAWVSKLRH 

HRLYRQNErVRSPRDASFHIFPSTSTAE 

SSPAANVSVMDGKMQPNSFPWQSPLP 

CSNSLPATCTTGQSKVAAWLQDSEEM 

DRCAEDLAHCQSNLVELSKLLQNLEIL 

QRTQSAPNFTDMQANCVDISKKDKRV 

TRRWRTKSVSKDTKIOLOVPFRATM^P 

VRLHSSNPNLCADIEFQTPPSHLTDPLE 
SSTDYTKLQEEFCLIAQKGKGASKKQA 
KRNAAEKFLAKFSNISPENfflSLVSNVD 
S YD VNVIKHFLQ 


295 


A 


1 


1248 


MLRTRKAPHSWVKSSSNTVHYRVSVV 
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SEQ 

ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
HNHistidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
jx avi gm m e ? o — ocimc, jl — xnreomne 9 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible nucleotide 
insertion 










CLHDHVTD WEWQLTAT ARHPKRV SH 

YILWDQEKTKIKIRKDIIRILPSLDVEVK 

DITDSYDANWFLQLLSTEDLFEMTSKE 

FPIVTEVIEAPEGNHLPQSILQPGKTIVI 

HKKYQASR1LASEIRSNFPKRHFLIPTS 

YKGKFKRRPREFPTAYDLEIAKSEKEP 

LHWATKAFHSPHDKLS SVSVGDQFL 

VHQSETTEVLCEGIKKVVNVLACEKIL 

KKSYEAALLPLYMEGGFVEVIHDKKQ 

YPI SELCKQFPvLPFNVKVSVRDLSIEED 

ECWEIPVGRLNMTVQLVSNFSRDAEPF 
LVRTL VEEITEEQ YYMMRJRYES S ASHP 
PPRPPKHPSVEETKLTLLTLAEERTVDL 
PKSPKRRR 


296 


A 


1 


906 


MFAFEPLGGCPvPWRLSLPGLGSRLFRT 

YGAADGRRQRRPGREAAQWFPPQDR 

RRFFNSSGSSDARMGDPSQSDDPDDPD 

DPDFPGSPVRRRRRCPGGRVPKDRPSL 

TVTPKRWKLRARPSLTVTPRRLGLRAR 

PPQKCSTPCGPLRLPPFPSRDSGRLSPD 

LSVCGQPRDGDELGISASLFSSLASPCP 

HLPEVSLDRASLPCSQEEATGGAKDTR 
MGSVRVLRDPVGVNLYEHSVSKCHVG 
QPDTDPREKVKAAPEELCLHALQHPRS 
EOADC 


297 


A 


574 


869 


QGAFWLLFSSPRSFFLLSVP/WWLPESS 
RWLLLHGKSQLAVQNLQKVAHRGDW 
PGSGHPAPQSQHSSLRRSAARSRPPCW 
ARRWRAPPHTPRVAGGSGC 


.298 


A 


225 


749 


ESVTFEDVAVEFIQEWALLDSARRSLC 
KYRMLDQCRTLASRGTPPCKPSCVSQL 
GORAEPKATFPGTT P ATCVAWF^OT l<r 

PEELPSMQDLLEEASSRDMQMGPGLFL 
RMQLVPSIEERETPLTREDRPALQDPP 
WSLGCTGLKAAMQIQRVVPVPTLGH 
RNPWVARDSGAIGNG 


299 


A 


1 


591 


PLPLDQRLLAS1TPSPSGQSIIRTQPGAG 
VHPKADGALKGEAEQSAGHPSEHLFIC 
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SEQ 
ID 

NO: 


M 

e 

t 

h 

o 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
aciu residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=GIutamic Acid, 
F=PhenylaIanine, G=Glycine, 
H=Histidine, I=IsoIeucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
k— Arginme, 3— oenne, I— inreonme, 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible nucleotide 
insertion 










EECGRCKCVPCTAARPLPSCWLCNQR 

z™ 1 ! /->c A XT CT T ■Pk'V/~nV v r r*r^\rv r^l T?\n toot 

DDEDNCADEPCSCGPSSCFVRWAAMS 
LISLFLPCLCCYLPTRGCLHLCQQGYDS 
LRRPGCRCKRHTNTVCRKJSSGSAPFP 
KAQEKSV 


300 


A 


1 


1569 


MKTCLAFPLAFHHDSNGSAVLRLQAT 

HGISSDSPLPSVEKASHSPDPSEYFRKH 

PPVRRSGLRTKRTSPGPGARVPGSQSF 

RSAEACGVAALECWRRRVPVPLSSPA 

EVQVLLKKALRPESRPFRNQILHNCER 

NWGNKGWKGLVGRSESQTGQSEKLS 

MSSHHRGTVREELWEEYIGGWCLCG 

SAWKLLVTGLEQLLFSRTRPQEEAVD 

KTWRTARQLESGTLLCRHCITLPWPSE 

RNGGCFLSPSNMLVCELRVLSVIVASP 

EPSTEHTQEHLSGDEFEKSQPSRKEKSL 

GLLCHKFLARYPNYPNPAVNNDICLDE 

VAEELNVERRRIYDIVNVLESLHMVSR 

LAKNRYTWHGRHNLNKTLGTLKSIGE 

ENKYAEQIMMIKKKEYEQEFDFIKSYT 

0 VJNbKJsJJKoLKVMoi^JsJr VMLrLVSTP 

QIVSLEVAAKILIGEDHVEDLDKSKFKT 

GSLVRLFAPCLSGAGSKLPGLVEALAF 

n VoLAAro V VAoVJborJir VAJLbbWVVH 
TVGLRPWGGVTLWWA 


301 


C 


236 


481 


MDLIVYHKKSDISNQPSIPTCALFFPCV 
SLEPFQLFPVKQTARRPPPYSSPGKSTG 
NVIPFGHGFPTLQPKXQITPVGGQY 


302 


A 


1 


1755 


MRQTKTEYIQEFNQEATVARALEGQE 

KPTEGPRNTCLGSNNMYDIFNLNDKA 

LCFTKCRQSGSDSCNVENLQRYWLNY 

EAHLMKEGLTQKVNTPFLKALVQNLS 

TNTAEDFYFSLEPSQVPRQVMKDEDKP 

PDRVRLPKSLFRSLPGNRSVVRLAVTIL 

DIGPGTLFKGPRLG1 GDO^frVT "NTNTRT V 

GLSVGQMHVTKLAEPLEIVFSHQRPPP 
NMTLTCVFWDVTKALVGGYDILAIYE 
VEHFQQEQCVAVI SVCSRGGKGS AEC 
GHWGKGLTTEHSSPVPAVTHLSLAR1A 
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SEQ 

ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=GIutamic Acid, 
F=PhenyIaIanine, G=GIycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=GIutamine, 
R==A rpinine S=Serinf k T=T , HrpnntnA 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e nucleotide 
insertion 










EKGKAGCPARACRVHSWVLVLSGKRE 

VPENFFIDPFTGHSYSTQDEHFLGIESL 

WNHKNYWINMQDCWNCCKVPREGEL 

GDPEERLAHL/SWWLGLSVHLHGGRS 

QSAAPELPSHHPVPAALMVYEPLPATT 

GLDL*PG*PCEMGVHAPGD**VSAVLD 

*RRRQWDKR*G*CGKSGQGG*G*ELR 

HAPLVWEOTFTSPFfiTMTT FTTf FWYfYIMP 

EDMLELKHINKTTDLKTDYFKPGHPQ 
ALRVHSYKSMQPEMDRVIEFYETARV 
DGLMKREETPRTMTEYYQGRPDFLSY 
RHA 


303 


A 


3 


1376 


GDQKVHPFSTPSPGTPAFHIPTTFSPAA 

GPGHHLPMDPGEGLAEGPGLP/GSSG* 

RPL*VPSRRASHCP\PGATKARGGRCR 

GPAATTG*AACAGRTAAPG*PGASPPA 

AQALHHSLQEPGEHRGRPGPSASAPSA 

GTVDQVGGGAERMPTTPGPRHAVGEC 

GPTCSASLRGPL*PLPNLAAPAQWGSH 

QLQGEEQIQVPSCCFAPGIQRLLPRPQT 

QEPGF*TQTPDPGLKPQDSKPRLPGLQ 

TQTPDPGPRTRADGFPDQRGPV\GQGQ 

WEGAPGGHTLGNSGGSCLA/GPPW*RS 

EGHEECPSSCQSQFGELRLWLPRGGW 

AEGVSAGSHGPPWPAGPAPPGPQPLG 

WDAGPHFPEESRTRPGPDPFP* K nHfiT 

VL*LTQRKHRDGHKEPRTKJQLPVPGA 
EGOTCPPEPWAGAHRRNANWOAOGS 
RRERPSGFQTPRSHWVPSAGRSGFLGP 
QFSCL 


304 


C 


215 


343 


MSGRVFRCQALVAYTVLSELFTEAKE 
QRLATDEGQKEFSAES 


305 


C 


215 


339 


MSGRVFRCQALVAYTVLSELFTEAKE 
QRLATDEGQKEFSAE 


306 


A 


2 


2483 


GKYYKLSSGTAPTCVSLGWGLARGDS 
AAPALGSRTSACAPCSHGTWKLSLEPS 
DRLSPCDRSSEEAHTHAPHRLLALVAS 
LPWSRLPLLAPQSHSEAEATSQPTGVE 
NHHQKTRYVKAGGPVICRSLPESRGFL 
WASEGRKCMLIGSWAAMGRLRKSTIS ! 
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SEQ 
ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=AIanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
R— Ar^inine S=Serine T=T , firpAnin a 
V=VaIine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible nucleotide 
insertion 










SRFGPQTLAGTGRPQAIPVLKKHSDAV 

LLGVCFLKLLHQHHQELGENADSQTL 

PQTHWEFILSEDYNKMTPVKNYQVLE 

VLARAMRQEKQIKSIQLGKEEVKLSVF 

ADDMIVYLENPIVSAQNLLKLISNFSK 

VSGYKINVQKSQAFLYTNNRQTESQIIS 

ELPFTIPSKRIKYLGIQLTRDVKDLFKE 

NYKPLLNEIKEDTNKWKNIPCSWVGRI 

MMKMAILPRVIYIFNAISIKLPMTFFTE 

LEKTTLKFIWNQKRARIAKTILSQKNK 

AGGITLPDFKLYYKATVTKTAWYWY 

QNRGVDQWNRIEPSEIIPHIHNHLIFDK 

PDKNKKWGKDSLFTKWCWENWLAIC 

RKLKLDPFLTPYTKINSTWIKDLNVRP 

KTIKTLEENLGITIQDIGMGKDFMSKTP 

KAMATKAKIDKWDLIKLKSFCTAKET 

TIRVNRQPTEWEKIFTIYPSDKGLIPRJY 

KELKQNLQEKIKQPHQKVGKGYKQTF 

LKRRHLCSQQTHEKMFIITGHQRNAKQ 

NHNKIHLTPVRMAIIKKSGNNRDMDE 

AGNHHSFOTTARTFNOAPYT T THPWF 

LNNENTWTQVEEHHTLGPIVGVICRKV 
FPGNSGPSKPSGLHFSQPLPQVTSVVA 
KITIVPWEMKLIAMGVQDELNIAFHKN 
HLLMNDTTIHMTPYIQPAPKS 


307 


A 


1532 


1937 


TPLPVCHFTCRKNHLKGMENLCLHKK 
CMWMSTVAFSII AKTWKOPR PP9 AAPS 

AVKQPTHLTTGDWANGLG*FSTREYVT 
A*ERTNQSKPDTTTWVNLTDVQLSNSS 
QAPRGVSTTLQFPVLGTVDKSGVTMT 
FWV 


308 


A 


1 


939 


MGNKTYGGQNQMLIFAFTLHSLFLNS 

GDGRLSFESSSQKPGGFRNIAIQTSPSL 

PxKHFPVFKRKRLTASKSVEEMPTASQS 

AIHVNGNLSEQDIVSSDLAYLRLAQHL 

EDGPRRVKVSHAFLPRVPKVQSNGPVS 

ICLEAGTWRSLEKATAAIQVPDDIYHS 

PSWEARESALSPDRSAEHNSLSRPSDP 

GLSLQPQLLPTLCLPFHVLYTRSPQSLG 

HGPIAVHGLLGTMLRSRRTWSFLYPGF 
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SEQ 

ID 

NO: 


M 

e 

t 

h 

o 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 

AV1U 

residue of 
amino acid 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 

dtlU ICMUUC 

of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=GIutamic Acid, 
F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
xv — Argwiiiic, o — ocrnie, a — lnreomne, 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /=possib!e 

UUVlbUUUC UC1CUV11) \ UU531UIC IIUCICULIUC 

insertion 










LPWCSGRIGSRVGLENECKVSLSGSSS 
QPMGEPEGRWSSPEVGPLASPGSPL1A 
WAKLRFVPPVDDLPV 


309 


A 


1 


1528 


MDSQEVEKYPNTSVACEEIPFSGIHVA 

GGKSGALEHGKDDLDEPIENPLFCFSSF 

SNALAILLPKVFLKMHILQFIYRSFHLL 

TMAKAKFEGAESVEPVSPSQPKRPSYV 

PLEELWTRLTKGNSRPQQRDREKGGW 

MKGVQQGHQGVGKQEEGSENIKEKA 

GIVVCEVPNNKLDKFMGILSWKDSKH 

SLNNEKIILRGCILRNTSWCFGMVIFAG 

PDTKLMQNSGKTKFKRTSIDRLMNTL 

VLWIML1SQPWEFIMRGHSYFINWDR 

KMYYSRKAIPAVARTTTLNEELGQIEY 

IFSDKTGTLTQMMTFKRCSINGRIYGE 

VHDDLDQKTEITQLIHRWLARLKKKK 

REKN QTDTIKNDKGNITTDL AETQTTI 

PPWTTITr VTKTfcTT TTMT PPA/fTYtfTTT nAVT 
JVC I I JSJlUU i IlNiSJLlMNl^IlJilV^ 1 

LSRLNQEEVESLSRPITSSEIEAVINSLP 
TKKSPGPDRFTAELYQKYKEELEKEPV 

DPKVHEFLRLLALCHTVMSEENSA 


310 


A 


104 


315 


DWTVGFVGNSDTELPGSVGRRSLWES 

SYSTRTRNQGRQAIQIQHS*LREVERKS 

GQKATMSSGGGYCQPE 


311 


A 


271 


1020 


AIRQEKEIKGIQLGKEEVKLSLFADDMI 

LYLENP1VSAQNLLKLISNFSKVSGYKI 

NVQKSQAFLYTNNRQTESQIMSELPFTI 

ASKRIKYLGIQLTRDVKDLFKENYKLL 

LKEIKEDRNKWKNIPCSWVGRINMVK 

MATT PK VTVCiFNTATPTTO PMTFFTFT WT 

TSKFIWNQKRAHIAKSILSQKNKAGGI 
TPPDFKLYYKATVTKTAWTRKIYSAK 
KRKVKISVEPVYSGVTLTTAIQLVPLLC 
TAL 


312 


B 


1 


889 


MDYEKADKRPTPWEAAAKSPLGLVD 

DAFQPKMQESrVANVVSAARRKVLPG 

PPEDWNERLSYIPQTQKAYMGSCGRQ 

EYNVTANNNMSTTSQYGSQLPYAYYR 

QASRNDSAIMSMETRHLYTRQLYCYS 
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SEQ 

ID 

NO: 


M 

e 

t 

h 

o 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 

dtlu lUlUUC 

of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=PhenyIalanine, G=Glycine, 
HNHistidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
iv— /vrgimne, a— oenne, i— inreonme, 
V^Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 










FGDSGNFCENTNGRPAADAVRGLTILS 

LSTTSIPSSGISEALISENENKNLEHLTH 

OnWPQTTT HTPP ATlf Trv^TCEXT Am/ 
vjvj I vjz,l>i l i-r^XJKxA i 1 V(/d JirrJLAJr V 

KTEVPLAEN QRSGPDCAGSLKEETGPS 
YQPvAPQMPDSQRGRVAEELILREKVE 
ASTQNNYYVGELTGVTLQNGYGEKPI 
LATQX 


313 


B 


471 


1448 


MLKYTGAHQEVELSAPIVTKMATQYL 

RENLFGRFDNDNFCLLNGDAVIFRMY 

VSWKLVEKERTEIMLKYTGAHQETWL 

KDLEESPLYEALSMRGQDKETLGLWI 

QLPWCPWGKAVQMHMNPSSFQLDTK 

PGKGELAGRLIIPHQEASILELSLLLMT 

CCVEREGKTSVRVAAVGECTASETPN 

QGAGRLSLWQQLTSKKETIMEKEHTD 

^ v av^ l V AL,lb 1 t_ VKJMjubKr ADKDLEE 

GGGLEAESPKQSPNLCVILRHNLASRP 

GQLALVTVGTMQGRPLSHSSEVKGTT 

FVTHSVPAGKEKDEERGIGDLEHARDL 

RNSPTPLFY 


314 


A 


1 


903 


MSELPFTIASKRIKYLGIQLTRDVKDFF 

KENYKPLLNEIKEETNKWKNIPCSWV 

GPJNIVKMAILPQRLPHGFLPNMKLEV 

VDKRNPRLIRVAHVDVDDQRVKHSM 

TASSGSGVSADLNTASQPLWLLKTAL 

AVSSSVKVHPPVSGLIFSSSRTLLSFMG 

IMREDLGFSRRQILHFPMALSKSAGRR 

PNPENFSWTEYLEATQTNAVPAKVFK 

MDSDVGENRKILRDERPNYSQYTPFSR 

CDNASYKENVFLQKLERNTPDIAERFD 
CT T T TY 


315 


A 


12 


253 


MMSWSSAEKGPEGHRRREWPVSQWED 
EP*NQNGESRKKERKEKRRKET* EERR 

GEKREKREKKRRREGRMSISFARRYSI 
L 


316 


A 


2020 


3942 


SQRTAGNPCLHPVSLCGSASWMPMIM 
PQRWSSLCSAMEKPASPCL*MPPQATC 
WCPSRLPMAWA\SGH*HTSTGHSQLPA 
IPFDNHCGKRCRLGGKWRAPLQHPQW 
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SEQ 

ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, 
EHHistidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
R=Areinine % S=Serine. T=Threonine 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, Y=possibIe nucleotide 
insertion 










K 


317 


A 


2 


271 


RRPTRPQEEGGSESSTMTELETAMGLII 
DVFSRYSGSEGSTQTLTKGELKVLMEK 
ELPGFLQLSGPGLGHQHTLLLLFRSAS 
WSRLVPQ 


318 


A 


1 


455 


MVEGKEEQVTSYVDVQRACAGIRGAF 
EKPOGAVARVHIGOVIMSICTKLONKF 
HVIEALCKANFKFPGRQNIHFSEKWDF 
TKESVDEFEDMMAEKQLIP/DNCGVKY 
TPNRDPPDKRDGVALQHGLLLWQLLQ 
NKIRLHQGREKKPPKKARR 


319 


B 


1 


370 


MSRRKOGKPOHLSKREFSPRDRFFVTT 
CFPCPPPTPPGLVTSPPAPRARLGQPCS 1 
ARNENLLEADYDPPEPIVLRNTTATHT 
HSHSVSPSLYNSDSPQPLKHLGAVSAA 
ETGVRGMMGMYLKPX 


320 


B 


1 


3204 


MCELDILHDSLYQFCPELHLKRLNSLT 

LACHALLDCKTLTLTELGRNLPTKART 

KHNIKRIDRLLGNRHLHKERLAVYRW 

HASFICSGNTMPIVLVDWSDIREQKRL 

MVLRASVALHGRSVTLYEKAFPLSEQ 

CSKKAHDQFLADLASILPSNTTPLIVSD 

AGFKVPWYKSVEKLGWYWLSRVRGK 

VQYADLGAENWKPISNLHDMSSSHSK 

TLGYKRLTKSNPISCQILLYKSRSKGRK 

NQRSTRTHCHHPSPKIYSASAKEPWIL 

ATNLPVEIRTPKQLVNIYSKRMQIEETF 

RDLKSPAYGLGLRHSRTSSSERFDIML 

LIALMLQLTCWLAGVHAQKQGWDKH 

FQANTVRNRNLKJYSHMVTLWGNYEG 

ISQTQAFAKENNQKAYKETYGVSHITR 

HDMLQIPKQQQNEKYQVPQFDQSTIK 

NIESAKGLDVWDSWPLQNADGTVAEY 

NGYHWFALAGSPKDADDTSIYMFYQ 

KVGDNSIDSWKNAGRVFKDSDKFDAN 

DPILKDQTQEWSGSATFTSDGKIRLFY 

TDYSGKHYGKQSLTTAQKAYRLEIVSL 

EMQKNGAADAAPYRQIEYWALGHGD 

DIKKAVAFWSSGWPVGFSKMEKAGKI 

LRSQVKFPEYMEESSCLGRGSLMSLNN 
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SEQ 

ID 

NO: 


M 

e 

t 

h 

o 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=GIutamic Acid, 
F=Phenylalanine, 0=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
R=Arpinin< k S=Sprinp T=TTirpnnino 

V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 










TSSSNGSFIFVLPLKLLRVGDTYNSSDQ 

SRMAWRLTIEFGGSELHLGVREEAGH 

QKGLVHESGNPARSSGSDPQHARHRQ | 

PSATRAAAAAAAAAAALPAPLSLPVPT 

SAIQVRVTAYPLLAQCLQAAFPPLLGS 

GCGQEGTGAGGGGGAAGVREQLEDR 

RAAAEPGDLPGGKRVRGRGAREGPGV 

GAEGPPLERNRPSSPLPWLAAPAAGAS 

QFAEIQGAGKGEMRAKDAERGRAKLR 

GELSSSGRKIFDPDDLYSGVNFSKVLST 

LLAVNKATEDQLSERPCGRSSSLSAAN 

TSQTNPQGAVSSTVSGLQRQSKTVEM 
TFNTCr^iT-IOT TVlf APFWFlf OTMPnirr <z\rr< 

X JZJN KJOn\£Lil V iVrvEvT INT JVV^ 1 IN CUlZLjti V 

KGDIIYVTRVEEGGWWEGTLNGRTGW 
FPSNYVREIKSSERPLSPKAVKGFETAP 
LTKNYYTVMSRSLTSTVLKNSKVARIH 
SKPY 


321 


A 


724 


1296 


RSPTLSSPPPASKAQALALRSEAQAQM 
PRLPAPRVRRSSAAASAAARSLAETFS 
GKECQWTDACLSHPCANGSTCTTVAN 

CQHGGTCLNLPGSYQCQCLQGFTGQY 
CDSLYVPCAPSPCVNGGTCRQTGDFTF 
ECNCLPETVRRGTFI WFRT")RFVWNfil<r 

**-J*s± y V^JL-f A A-/ a V XVXVVJ 1 1—tl—i VV X^XVX^/XXJL* V VV i N VJX\. 

EPDEN 


322 


C 


150 


362 


MPGPAAASHRASTYVSTWSCPPHHSW 
HAWOCTVARPHLOTSHCCTSGLPT AD 
MESRLVASPSEWNKLTWAQ 


323 


A 


1133 


1350 


ESQSLETGLRALIWSTRKPGGPVLGGL 
VLIKWAWASRSPASPSDPSPGPNLCCS 
PTSPATKPRVDGPFVIRN 


324 


A 


1 


615 


MNWVLQKFITAWKFMGYRKSSNSAR 
GSTIKEHIELDAQRPVRRSGPIQASGAH 
PKKGRGVSCSVEEPSDQQSPSPPSPLTF 
OPKDGEINFSVIGOYVDYT VKFOfiVK" 

NIFGKSTLGMSLHVSSSVFRRYILPGYQ 
PRGHTVMVSQVNIDFQTREATRKNLQ 
EPSLTCFDQAQGKVHSLMEKDSYPRFL 
RSKMYLDLLSQSQRRLS 


325 


B 


1 


669 


MVMSFVKPGVKEKEQVKKRDGEFNSE 
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SEQ 
ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
acid residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=PhenylaIanine, G=Glycine, 
H=Histidine, I=IsoIeucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
R=Arrfnine. S=Serine T=Thi-pnninA 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 










HAELDWARDTKRKFWEPTRLSSTLRT 

SSDPLFSVPISITMVCEPGSKSLQSCCLT 

AGGANVWEKSTCRKKSRQLVLRNVK 
vpgkspcCtFt t ptt kkivjot >jtt t t r>p\m 

TETLEGPPGLGLDAEGPEKRHSWILLP 
CPGIDHTSGLEVMSDLYHRKGNSLHPQ 
GKRTKDARKESFPQKMGQFPLQSLAVI 
YPEAGT 


326 


B 


1 


2043 


MEEHSMLMGRKNQYPvENGRIAQELEK 

TTLKFIWNQKPvACITKSNLSQKNKAGG 

ITLPDFKLYYKATVTKTAWYWYQNRD 

IDQWNRTEPSEIMPHIYNYLIFDKPEKN 

KQWGKDSLFNKRFWENWLAIFRKLKL 

DPFLTPYTKINSRWIKDLHVPvPKTIKTL 

EENPGITIQDTGMGKDFTSKTPKAMAT 

KAKIDKWDLIKLKSFCTAKETTIRVNR 

QPTKWEKIFATYSSDKGLTSRIYNELK 

QIHKKKTNNPIRXW AKDMNRHF SKED 

IYAAKKHMKKCSPSLAIREMQIKTTMR 

YHLTSVRMAIIQKSGNNRVLPLAPLAL 

AALWMDPVMPGMDGLLGDSESFQGL 

SATFFASVFHSALHIDSAPGPCIGPGDS 

SADSSPTFLPPEAKRKNYLLLWRKNLK 

KFSDDPKRLIEGFPKLALTFRLIWKDIN 

VLLGQALLQEERQTICGAAIHCRNDLH 

LENANYPGGATAVPQLDPNQDYNAK 

AGIWARNHRLLCLIETTTQQPTNAHSP 

QTQRQQHDTDKPQPNPPAKTTGVPVS 

FLAFLYQYLCGHISISWPWILKYAASV 

YGISLADRKROYDRYFRYFRI RTTK"P>J 

FLPFQIFKSGSVVKLKAGFTIGKVHNTE 
VTALKVSDTRRAQHLQTGCWSAWT 
HPNNLENVVRHPPEALAASYNKPFICS 
LVTLQGAFVT 


327 


A 


1 


1113 


MLMVYPRTNKQNQKKKWKVEPPTPQ " 

EPGPAKVAVTTSSSSSSSIPSAEKVPTT 

KSTLWQEEMRTKDQPDGSSLSPAQSPS 

QSQPPAASSLREPGLESKEEESAMSSD 

RMDCGRIPSTPNHRRSQVIEKFEALDIE 

KAEHMETNAVGPSPSSDTRQGRSEKR 
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SEQ 

ID 

NO: 


M 

e 

t 

h 

o 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 

residue of 
amino acid 
sequence 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
aciu residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=PhenyIalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=GIutamine, 
Jtv— /\rginiiie, o— serine, i— -inreonine, 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible nucleotide 
insertion 










AFPSKQDFTNEAPPQAPLPDASASPVST 
PKSQVTGQEGQNISWDMAWLKATQE 
APAASTLGSYSLPGTLAKSEILETHGT 

1V/TNJPTT FiQQPOATT? VTTDPMT nTAPXTA A r^T 

DRYYPEGPLPRSGDDTVLSPRPAWSKL 
AATRSQRLDPRQRGSTEREKGEAHND 
PLAAAPI S AAGSRRGAL AS WLASPTRS 
QNPARPPTPDCRAPLTESARPTS 




n 


i 
i 


ISA 


MLrJv WLMjtjJr JnJt M V r C V LbJN 1 CoLNKbL 
LENLIFATLKI S WGGAGIKG WNHMKA 
QLLTCVVFDAGFQLSPQGSLEQKHLIL 
YPELTGPIELDPICYHHHLSNLLSQTPK 
ASFEVLHQNH 


329 


A 


1 


768 


PIWATRCYGGWRGSLFLELPRCDGPLS 
RWGWAAPKMAASLLFSDQGFLASWIP 
RNGTLGHAMSGIALLEAVGHGRELWN 
PATSAASVISNTSSDANQFPKLPLGEISP 
DKEKSYWSSNPTKNNPKETTRVAVFA 

PTT TY\/"MTYM A DXJT7 A ^7T?"VTVrTr\7/**»T7Ki A DT)P 

QRDACGRLGDWDSEKQEGRKGFTSTN 
SVILHKKRTKRSVLGGDKFWQKKSPES 

PQGDFQDSRSLRSTVKFHKSTRKLQGE 

VFi<r <5 a P"wvr;pnPT 
vjdjvoo v/vinxn i urUrL 


330 


B 


44 


329 


GCLEHEASSAYEWLWSLCALLDMYTA 
GPTKTQTLQPMGQPNLKGDGGFTRES 
TGFMQLPADFI SSLICHETWVPGKPSTA 
MHRGRYWAEPIMLPKX 


331 


B 


3 


431 


MSEKNTPLVLSGENQKKGREIGVCRK 

LVSFHSHNYPSKVLLQGNLDTETCTER 
RQREIWTQRHEGKCGHRDMYREKTK 
RKYREKAIYKLRKGPETDPSSQPSERT 
NPANTLISDS 


332 


A 


5 


1269 


TISWRQGRGGEAGRRLWHTPPQVKHL 
EPGHP/EQQARKAQIQGQPPAPGWDLT 
OGEOAG/NIHOHCGOSRG*OOHOKDP* 
GHRFAEGMQSLSKELQSD*TSRKGGFR 
VPCSHNREPPTRPGDPCDSNPAGLGLQE 
CRARYRPGKPSS/PPRGQSRAT/GPVRW 
HPSPSRN*G/PPGSRPAPGTDPAPGRPP\ 
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SEQ 

ID 

NO: 


M 
e 
t 
h 

0 

d 


Predicted 
beginning 
nucleotide 
location 
corres- 
ponding to 
first amino 
acid 

residue of 
amino acid 
sequence . 


Predicted 
end 

nucleotide 
location 
corres- 
ponding to 
first amino 
aciu residue 
of amino acid 
sequence 


Amino acid segment containing signal 
peptide (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=PhenyIalanine, G=Glycine, 
H=Histidine, I=IsoIeueine, K=Lysine, 
L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
K— Argmine, &-*©erine, 1 — 1 nreomne, 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *==Stop codon, /=possible 
nucleotide deletion, \=possible nucleotide 
insertion 










GRPLAASGLLPNSPPAPG/SP/QGPPPPR 
GSNR/PRFPHWLRRPAGRGAPC*PQPRS 
PQ/QHIPEHRTKPVPAPEPPSGSRNTDPP 
GQPRARGTWKASPGHRADSASRRASF 

WFILNGFIGGVIGRRMTDCQACEPRLR 

c'xr\r^r\i dccvtct pttt>a at nrr\c^T>vxr\7T 

ERDQPSACSLKTPAQTCLPQCSLHWTL 

i\U\JK^ I l^Jr JL 1 AJroo 1 J.VUN vjA i KivlivL, 


333 


A 


233 


380 


PLVVCLLEFYCTHLRDGLNSVQLAYR 
GCRPTEATFTPARRPWQARAPCR 


334 


B 


220 


309 


MAEYDLTTRIAHFLDRHLVFPLLEFLS 
VKE 


j j j 


A 


j / / 




IvO/vKlJv lAVLVLVLVLV I Jr ^ V ri V 1 C 

VRMCLCVCVCVCVCVCVCGGCKCTC 
GPTEGGEKAWLFTSIQEGRRCGWSSSL 
RGSAAGPvDLYSAPvLFAHRLLLLEGRP 
WQDAGAPSAARISRSEPWS 
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WHAT IS CLAIMED IS: 
5 1. An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1-84, or 168-251, a mature protein coding portion of SEQ ID NO: 1- 
84, or 168-251, an active domain coding portion of SEQ ID NO: 1-84, or 168-251, and 
complementary sequences thereof 

10 2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 
conditions. 

3. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
1 5 polynucleotide has greater than about 90% sequence identity with the polynucleotide of claim 1 . 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
20 complementary sequences. 



6. A vector comprising the polynucleotide of claim 1 . 

7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 



25 



9. A host cell genetically engineered to comprise the polynucleotide of claim 1 operatively 
associated with a regulatory sequence that modulates expression of the polynucleotide in the host 

30 cell. 

1 0. An isolated polypeptide, wherein the polypeptide is selected from the group consisting of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; 

(b) a polypeptide encoded by a polynucleotide hybridizing under stringent conditions 
35 with any one of SEQ ID NO: 1-84, or 168-251; and 
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(c) a polypeptide comprising an amino acid sequence selected from the group consisting 

of SEQ ID NO: 85-1 67, or 252-335; the mature protein portion thereof, or the active domain 

thereof 

11. A composition comprising the polypeptide of claim 1 0 and a carrier. 

12. An antibody directed against the polypeptide of claim 1 0. 

13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polynucleotide of claim 1 for a period sufficient to form the complex; and 

b) detecting the complex, so that if a complex is detected, the polynucleotide 
of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the polynucleotide of 

claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the method 
further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide. 

1 6. A method for detecting the polypeptide of claim 1 0 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polypeptide under conditions and for a period sufficient to form the complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 10 is detected. 

1 7. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
comprising: 

a) contacting the compound with the polypeptide of claim 1 0 under 
conditions sufficient to form a polypeptide/compound complex; and 
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b) detecting the complex, so that if the polypeptide/compound complex is 

detected, a compound that binds to the polypeptide of claim 10 is identified. 

1 8. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
5 comprising: 

a) contacting the compound with the polypeptide of claim 1 0, in a cell, under 
conditions sufficient to form a polypeptide/compound complex, wherein the complex drives 
expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, so 
10 that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide 

of claim 10 is identified. 

1 9. A method of producing the polypeptide of claim 1 0, comprising: 

a) culturing a host cell comprising a polynucleotide sequence selected from 
1 5 the group consisting of a polynucleotide sequence of SEQ ID NO: 1 -84, or 1 68-25 1 , a mature 
protein coding portion of SEQ ID NO: 1-84, or 168-251, an active domain of SEQ ID NO: 1-84, 
or 168-251, complementary sequences thereof and a polynucleotide sequence hybridizing under 
stringent conditions to SEQ ID NO: 1-84, or 168-251, under conditions sufficient to express the 
polypeptide in said cell; and 
20 b) isolating the polypeptide from the cell culture or cells of step (a). 

20. The isolated polypeptide of claim 10 comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 85-1 67, or 252-335, the mature protein portion thereof, or 
the active domain thereof. 

25 

21 . The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide array. 

22. A collection of polynucleotides, wherein the collection comprising the sequence 
information of at least one of SEQ ID NO: 1-84, or 168-251. 

30 

23. The collection of claim 22, wherein the collection is provided on a nucleic acid array. 

24. The collection of claim 23, wherein the array detects full-matches to any one of the 
polynucleotides in the collection. 

35 
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25. The collection of claim 23, wherein the array detects mismatches to any one of the 



polynucleotides in the collection. 

26. The collection of claim 22, wherein the collection is provided in a computer-readable 
format. 

27. A method of treatment comprising administering to a mammalian subject in need thereof 
a therapeutic amount of a composition comprising a polypeptide of claim 1 0 or 20 and a 
pharmaceutically acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subject in need thereof 
a therapeutic amount of a composition comprising an antibody that specifically binds to a 
polypeptide of claim 10 or 20 and a pharmaceutically acceptable carrier. 

29. A method of detecting bone marrow cells or tissues in a sample comprising: 

a) contacting the sample with a compound that binds to and forms a complex with 
the polynucleotide of claim 1 for a period sufficient to form a complex; and 

b) detecting the complex, so that if a complex is detected, the polynucleotide of 
claim 1 is detected 

wherein the presence of the polynucleotide of claim 1 indicates the presence of bone marrow 
cells or tissues. 

30. A method for detecting bone marrow cells or tissue in a sample comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polypeptide under conditions and for a period sufficient to form a complex; and 

b) detecting formation of the complex so that if a complex is detected, the 
polypeptide of claim 10 is detected, 

wherein the presence of the polypeptide of claim 10 indicates the presence of bone marrow cells 
or tissues in a sample. 



i 
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<110> Hyseq, Inc 

<120> NOVEL BONE MARROW NUCLEIC ACIDS AND POLYPEPTIDES 

<130> 21272-050 

<140> NOT YET ASSIGNED 
<141> 2001-03-30 

<150> 60/250,583 
<151> 2000-11-30 

<150> 09/540,217 
<151> 2000-03-31 

<150> 09/649,167 
<151> 2000-08-23 

<160> 335 

<170> Custom 

<210> 1 

<211> 516 

<212> DNA 

<213> Homo sapiens 



<400> 1 

ggaggcagag ctctgggaat ctcaccatgg cctggacccc tctcctgctc cccctcctca 60 

ctttctgcac agtctctgag gcctcctatg agttgacaca gccaccctcg gtgtcagtgt 120 

ccccaggaca aacggccacg atcacctgct ctggagatgc attgccaaaa aaaacatcct 180 

tattggtacc agcagaagtc aggccaggcc cctgtactgg tcatctatga ggacaacaaa 240 

cgaccctccg ggatccctga gagattctct gggtccaact tagggataat ggccaccttg 300 

attatcagtg gggcccaggt ggaggatgaa gctgactact actgttattc aacagacagc 360 

agtggtaatc atagggaggt gttcggcgga gggaccaggc tgaccgtcct aggtcagccc 420 

aaggctgccc ccttcggtca ctctgttccc gccctcctct gaggagcttc aagccaacaa 480 

ggccacactg gtatgtctca taagtgactt ctaccc 516 



<210> 2 

<211> 458 

<212> DNA 

<213> Homo sapiens 



<400> 2 

tcaagacaaa gctctccttt gtaaaccaga ccctttccat tcagtctatc acagggtggc 60 

ttactcggca ccccttttca gccccgctct cctcttcagt tctcactggg gcttttttgt 120 

tcttaattcc ttttcatggc ccggcaaaaa cggagttaat tatattaaag acctgacttc 180 

cctgtctagc tccttaactc caggtcagca gataattgag agtcattgcc ctgatactga 240 

atgaagagat aaagttccca ggtttatttc aagtgactta tctgaagatg aggaaagagc 300 

aagaggttac taaaaaacat atctgtgaat tgttgacaga gacggtcact tctgcagaaa 360 

ctccagatgc ccttgccaag tccaggtaca ggtctaaact agcaaaccaa atgcattttc 420 

taggtttaaa gtacaggtgg gtgctcatct tcaggatg 458 



<210> 3 
<211> 922 
<212> DNA 
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<213> Homo sapiens 



<400> 3 

tttttttttt ttgacaattg ttatttagtt tttatttgca taatcataaa cttaactctg 60 

caaatccagc taggcatggg agggaacaaa ggaaaacatg gaacccaaaa gggaactgca 120 

agcgagagcc acaaaagatt cfcaggattac tccgagccaa atggggtgga aggggtgctc 180 

tccttgagct acagaaggaa tgatctggtg gttaagataa accacaaagt caaacttatt 240 

tcgagttgtc cacagtcagc attggtgatc tccttgctgg tcttgccatt cctgggaccc 300 

aaagccgctc catggcctcc acaatattca tgccttcttt cactttgccg aaacaccaca 360 

tgctttgcca tccaaccact ccagtcttgg gcagtgcaga tggaaaaact ggggaaccat 420 

ttgtgtttgg gtccagcatt tgccatggga caagatgcca ggacccgtat gctttaggat 480 

gaagtttctc atccttcaaa tttctcccca tagatggact tgccaccagt gccattatgg 540 

cgtgtgaagt caccaccctg acacataaac cctggaataa ttctgtgaaa gcaggaaccc 600 

ttataaccaa atcctttctc tccagtgctc agagcacgaa aattttctgc tgtctttggg 660 

accttgtctg. caaacagaag gcaaaattct aaggaacagc aagcactgct gcacgatcag 720 

gggtaaacag ctgttctttt agtcacttgt aatgaaaatg ctactacctc caatctgtta 780 

acatagcttt atggattaga tgagcagtca cttcttggta tccaggcccc ttacctcaaa 840 

ggagacgcgg cccaagggct cgccgtcgac ggcaatgtcg aagaacacgg tggggttgac 900 

catggctaat agtacacggt tt 922 



<210> 4 

<211> 1185 

<212> DNA 

<213> Homo sapiens 



<400> 4 

gcgcctgtgg ggaggccgcc tgtgcctcgg tacatggtgc caaccgcctc ggggcaaact 60 

cgctcttgga cctggttgtc tttggtcggg catgtgcpct gagcatcgaa gagtcatgca 120 

ggcctggaga gtgaagtccc tccaattaaa ccaaacgctg gggaagaatc tgtcatgaat 180 

cttgacaaat tgagatttgc tgatggaaga agcataagaa catcgggaac tgcgactcag 240 

catgcacgaa gtcaatgcaa aatcatgctg ccgtgttccg tgtgggaagc gtgttgcaag 300 

aaagttgcgg gaaaatcagc aagctctatg gagacctaaa gcacctgaag acgttcgacc 360 

ggggaatggg ctggaacaca gacctggtgg agaccctgga gctgcagaac ctgatgctgt 420 

gtgcgcttca agaccatcta cggagcagag gcacggaagg agtcacgggg cgcgcacgcc 480 

ggggaaaact acaaggggcg ccttctcgcc acgcccacct gcacctgcct tttcctcccg 540 

cctggtggga ctcaacccca cccctgcatt ttctctgcat tttatgtcgt ttccccaaaa 600 

gtatatccaa aaaatgcctt ttttccccct ggtaactttg atccctgggt tctcgccatt 660 

ttctggatca ctgtgacctg ttccttgctt tgggtcagca tccactgatg ccagcagtgg 720 

catctccaag ccaatgtgct ttgctgttag aaggccaagg ttagaagtgc agccagcgtg 780 

gcatgaccag gaaataaatg ccagtttatt aaataacgag taagccaccg tttcaagcct 840 

gccctgtgga ggaaatgcca gtttattaaa taacgagtaa gccaccattt caagcctgcc 900 

ctgtggagga aatgccagtt tattaaataa caagtaagtc accgtttcag acctgccctg 960 

tggaggaaat gccagtttac taaataacga gtaagccacc gtttcaagcc tgccctgtgg 1020 

aggaaatgcc agttcattaa ataacaagta ttatagagcc tgtcctgtgg tttgattacg 1080 

gagcctgccc tgtggtcact tgttcttcag atgaactgat ttttgtgcaa agcacacgtg 1140 

ttggattctg cctggtgcag attgatgagt atgatcactc caagc 1185 



<210> 5 
<211> 416 
<212> DNA 

<213> Homo sapiens 
<220> 

<22l> misc_feature 
<222> (l) . . . (416) 
<223> n = a,t,c or g 



2 



WO 01/74836 



PCT/US01/10472 



<400> 5 

gttagagatt acaatttgac tgaagaacag aaggcgatca aggccaagta tccgccagtc 60 

aataggaagt acgagtattt ggatcataca gcagatgtcc atggattgta ttacacagag 120 

catgaatata cttctttagg ttacacgcat ggggagatac tctggaggaa gcatttgagc 180 

aatgtgcaat ggccatgttt ggttacatga cagatactgg gacagtggag cccctccnnn 240 

nnnnnnnnnn nnimnnncaa ggtggggaga agaattttca ttgtccaagc accctcaggg 3 00 

aacagaagtc aaagcaataa catattcagc aatgcaggtc tataatgaag agaacccgga 360 

agtttttgtg atcattgaca tttaagacac ccaaaaatan aagactccta cgaaaa 416 



<210> 6 

<211> 787 

<212> DNA 

<213> Homo sapiens 



<400> 6 

aattcgaggg tgtaaagaag aggaagggga 
aagcccccgg cctgagaaag aaggaggagt 
tgtgtacgct gaggggggag ggctcttctt 
ccttccctcg cccaactctc cctcggggcc 
cccaaaatgg agaacgcgga gcagcgcaag 
gatttaggtt ccttttctcc ggcaggctcc 
ggaacaacat tcatgtgacg ggcccctctg 
tggtggggga acctgctggg gaaggcgatt 
cgacttcgga attgtcttac aagaaagaag 
atgaatatat ttctgacatt gaggtgttcc 
agaaacacca actgatttta atctactaca 
agattttgaa cgatatcgca ggaaaaaacc 
aaggcgaaga gggcccagca gaaaaggtct 
ctgcgcg 



aaaatatctt 


gtaccagccc 


aggggtgaag 


60 


gggggaggcg 


aacagtctcg 


ttgctgcctc 


i20 


ttcgactacc 


ctcgctgaag 


attttctctt 


180 


tctacggaga 


cccccccccc 


ccatgggtgg 


240 


atggaggctc 


cgcttctctg 


tgtagcagta 


300 


taagcctagg 


agagcccgaa 


ctggagagag 


360 


cttctccccc 


gccccatcat 


ctccagcgcg 


420 


ggcatcgatc 


tctccatccc 


ttccgaggcc 


480 


gcatgccacc 


attttacccc 


acgtaaatat 


540 


agatgatgat 


aaagaaatga 


tagctgctcc 


600 


ggagtcagaa 


tcacattttt 


cttctgacac 


660 


gaactcaggg 


caaaggcaat 


acttgtagaa 


720 


gaggtggaaa 


aggaagagga 


acacctactt 


780 
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<210> 7 

<211> 10918 

<212> DNA 

<213> Homo sapiens 



<400> 7 

cgctgagtac cgtccggatt ccctggtcga ctattgctgt gcgccattga ggagcggctg 60 

tgaggatgcg cacgccgcaa ggtctagact ggacgcgagt ctaccgagac accgaccgac 120 

tgttctgtca ccatgaaaga tacagataca gaatgatcct tgacgtctaa acaccttttc 180 

ctgaggatat agtcactgtt ggaaggcctt gtaacctggg ttctcaatac ctatttggga 240 

aaatatgtca ataacctgaa cactgaccag ctctcagttg cacttctcac aggtgctgtt 3 00 

gaattacaaa acttgccatt aaagaaagat gccttgaaag aattggaatt accatttgaa 360 

gtcaaagctg gcttcattgg gaaagtaacc cttcagattc ccttttatcg cccccatgtg 420 

gacccttggg tgatctccat ctccagcctt cacttaattg gagccccaga gaaaatacag 480 

gatttcaatg atgaaaagga gaagctgttg gaaagggaac gtaagaaagc actacttcaa 540 

gccctggagg agaaatggaa gaatgaccgc cagcagaaag gggagtccta ttggtattca 600 

gttaccgcct ccgtagctac aaggattgtg gagaatattg aattaaaaat tcaagatgtc 660 

catttacgct ttgaagatgg cgtcaccaat ccctcccatc cttttgcttt tggcatctgc 720 

attaagaatg tgtccatgca aaatgctgtg aatgagcctg tacagaaact aatgcggaaa 780 

aagcaattag acgtagcaga atttagcatc tattgtgatg tcgattgcac tttactgggg 840 

gatttgcctc agatggagtt acaggaggcc atggccagga gcatggagag tcgcagccat 900 

cactacgtcc tggagcctgt gtttgcatct gctcttttga agagaaactg ctccaagaag 960 

cccctgcggt ctcggcacag tccccgtatt gattgtgata ttcagctgga gaccattccc 1020 

ttgaaactct ctcagctgca ataccggcaa atcatggaat tcctcaagga gctggaacga 1080 

aaggagaggc aggtgaagtt ccgaaggtgg aaacccaagg tggcgatatc taagaactgc 1140 

cgagaatggt ggtattttgc tttgaatgct aacttgtatg agatcagaga gcagaggaaa 1200 

cgttgcacct gggactttat gttgcaccgc gctcgtgatg ctgtatctta cactgacaaa 1260 

tatttcaaca agttaaaagg aggcctgctg tccacagatg acaaggagga aatgtgtcgg 1320 
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attgaagagg aacagagctt tgaggaattg aagattttgc gtgaactggt tcatgatcga 1380 

tttcacaaac aggaagaact agcagagagt ctgcgggagc ctcagtttga ttctccagga 1440 

gcctgtccgg gagccccaga acccggtgga ggcagtggga tgctgcagta tctccagtcc 1500 

tggtttcctg gatggggtgg ctggtacggg cagcagaccc cagaagggaa tgtggttgag 1560 

ggactgtcag cagagcaaca ggagcagtgg attcctgaag agatcctggg cactgaggag 1620 

ttttttgacc ccactgcaga tgcctcgtgt atgaacacgt atacaaagcg agatcafcgtc 1680 

tttgccaaac tgaatttgca gttgcagcga ggtacagtga ctctgttaca caaggagcaa 1740 

ggaactcctc aaatgaatga aagtgctttc atgcagctcg agttttcaga tgtaaaactt 1800 

ctagcagagt ctcttcctcg aagaaattcc tcgttgcttt cagtccggtt gggtggactg 1860 

tttcttcgag acctggctac agaaggaact atgtttcctc ttctagtctt ccctaatcca 1920 

caaaaagaag ttggcagagt ctca.ca.atct tttggtctac aaactacatc tgcagacaga 1980 

agtgatcatt acccagctgc agatccagat ggccccgttt ttgagatgct gtatgagaga 2040 

aatccggcgc acagccactt tgagaggcgg ctcaatgtca gcacaaggcc cttgaacatc 2100 

atatacaatc cgcaggccat taaaaaagta gcagactttt tctacaaggg aaaggttcat 2160 

acctcaggtt ttggttatca gtctgaactt gagctgagag tggctgaagc tgcccgaaga 2220 

caatataaca agctgaagat gcagaccaag gcagaaatcc ggcaaactct tgatcgtttg 2280 

ctagtgggtg atttcattga ggagagtaaa cgatggaccg tgcggctgga tatttctgcc 2340 

cctcaggtga tatttcctga tgatttcaaa ttcaagaatc ctgtgttagt tgtcgtggat 2400 

ctaggaagaa tgcttttgac gaacacccaa gataactcca ggaggaaaag tagggatggg 2460 

tcagcatctg aagagaccca gfcttagtgat gatgaatata agacccccct ggccacacct 2520 

cctaacaccc cacctcccga gtcaagcagc agcaacggag agaaaacacc tcccttttct 2580 

ggagttgagt tcagtgaaga acagcttcaa gcacatttaa tgagcacaaa gatgtatgag 2640 

aggtactcgc tgtcatttat ggacctccag atcatggttg gacgagtgaa agacaattgg 2700 

aagcatgtcc aggatattga cgtgggacca acacatgtgg tagagaagtt caacgttcac 2760 

ctacagttag agcgtcgatt gatttatact tcagatccca aatatccagg agccgtgctc 282 0 

tcaggcaact taccagactt aaaaatccac attaatgaag ataaaatatc tgcactaaag 2880 

aattgctttg ctctcctcac caccccagaa atgaaaactt ctgacactca gattaaagaa 2940 

aagatttttc cccaggagga gcagcgggga agtttgcaag actccgtaat gaatttaacc 3000 

cagagcattg tgttgttgga gcagcatacc cgcgaggttc tggtggagtc gcagctcctc 3060 

ctggcggaat ttaaagtgaa ctgtatgcag cttggtgttg agagcaatgg ccggtacatt 3120 

tctgtgctca aggtgtttgg taccaatgct cactttgtga agaggcctta tgatgctgaa 3180 

gtctccctaa ctgttcatgg tttgctcctg gtggatacca tgcagacata tggtgctgat 3240 

tttgaccttt tgatggcttc acataagaac ttgagctttg atattccaac gggaagcctt 3300 

cgggatagca gggcccagtc tcctgtctct ggaccgaatg tggcccactt aactgatgga 3360 

gctacactga acgaccgatc agctacgtga ggtgtctcac ttggcaggat tcgttccaaa 3420 

gagcgagagt cccttagtaa gttggaatat cagtgtggga gttcagagcg cccatcgatg 3480 

aatttacaca gtactcttca ggtgatttcc ctgcaggtga ataatttaga tattatcgtc 3540 

attccagaga cgagtgtgga gctaagtggt tttcttcaaa aatcctttcc caaggaaaaa 3600 

gatgatctaa gtcctcaacc tttaatgact gtttgtgaaa gaagcttcag agaacaagga 3660 

tcgttccagt ctacatatga acgaatcact gaggttgcag tggaaatcca tagaatgaac 3720 

ttagtgcttc ttcggacagt gggcatggca aatagagaga aatatggcag aaaaattgca 3780 

actgcaagta taggtggcac caaagttaat gtctcaatgg gtagcacgtt tgacatgaat 3840 

ggttctcttg gctgtttaca gcttatggat ttgacacaag ataacgttaa aaaccagtat 3900 

gttgtcagca ttgggaattc tgtaggctat gaaaatatca tcagtgatat tggctacttt 3960 

gaatctgtgt ttgtcagaat ggaagatgca gccctcactg aagctttgag tttcacgttt 4020 

gttgagagat ctaaacagga gtgttttctc aacctgaaga tggcttcttt acattataac 4080. 

cactctgcta agtttttgaa ggagttgacg ttatccatgg atgaactgga agaaaatttt 4140- 

cgaggtatgc tgaaaagcgc agccaccaaa gtcaccacag tactagctac caagactgcc 4200 

gagtatagcg agatggtatc gctctttgaa actccaagga agactcggga accctttatc 4260 

ttagaggaaa atgaaatata tgggtttgac ctagcttcgt ctcatttgga cactgtaaag 4320 

ctaatcttga acataaacat tgaatcacca gttgtttcta tccctcggaa gccggggagt 4380 

cctgagttgt tggtgggaca cttgggacag atattcatcc agaattttgt ggcgggagat 4440 

gatgaatcca gaagtgaccg tctgcaggtg gaaatcaagg acattaaact gtattctttg 4500 

aattgcaccc agttggcagg tagagaagct gttgggtctg aaggaagccg gatgttttgc 4560 

ccaccttccg ggtctggcag tgccaacagt caggaggaag ctcatttcac acgacatgat 462 0 

ttctttgaat ctttgcatag aggtcaagct tttcacatcc tgaacaacac caccattcag 4680 

tttaaactgg agaagatccc tatagagaga gaatctgaat tgactttttc tcttagccca 4740 

gatgacctgg gaacttctag catcatgaag attgaaggaa aatttgtcaa tccagttcag 4800 

gtggtgttag caaagcatgt atatgagcag gttttacaaa ccctggacaa tctcgtgtac 4860 

agtgaagatc tgaataagta . tccagccagt gctacctcct ccccttgccc tgattctcct 4920 

ctgcctcccc tcagtacctg tggagaatct tctgttgaaa ggaaggagaa tggattgttc 4980 

agccactcca gcctttctaa cacctctcag aagtcattgt cagtgaagga agtcaaatcc 5040 

tttactcaga ttcaagccac cttttgtata tcagagcttc aggttcagct aagtggagat 5100 

ctgactttgg gggcccaagg tcttgtgagc ttaaagtttc aggactttga ggtggaattc 5160 
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agtaaagacc atccccagac tttatctatt cagattgccc tgcattctct gctgatggag 5220 

gacttattgg agaagaatcc agattctaaa tataagaacc tgatggtgtc tcgaggagcc 5280 

cctaagccat ctagtttagc acaaaaagaa tacctttctc agtcttgccc ctcagtgtcc 5340 

aatgtggaat atcctgatat gcctcggtct ctcccttccc acatggaaga agctcctaat 5400 

gtcttccagt tgtatcaaag gcccacctct gcgtcccgga aaaagcaaaa ggaagtccaa 5460 

gacaaggact atcccttgac cccacctcct tctccaacag tggatgagcc caagatactt 5520 

gttggaaaga gtaaatttga tgattcctta gtccacatca acatattctt ggtagataag 5580 

aaacatccag aattctcttc cagttacaat cgagttaacc ggagcattga tgttgatttt 5640 

aattgcttgg atgtgctgat cacactgcaa acctgggttg tgatattaga cttttttgga 5700 

atcggctcca ctgcagacaa ccacgcaatg aggctgcctc ctgagggcat tctgcacaac 5760 

gtgaagttgg agccacatgc ctccatggag tctggacttc aggatccagt gaacaccaaa 5820 

ctggatctca aggttcattc actttctcta gtgctgaata agaccaccag tgagcttgcc 5880 

aaagcaaatg tgtccaaatt agtagcacac ctggaaatga ttgagggaga cctggcctta 5940 

cagggcagca ttgggagtct gtctctaagt gacctcacat gccatggaga gttctacaga 6000 

gaacggttca ctaccagtgg tgaagaagca ctcatcttcc agacttttaa atacggacgg 6060 

cctgaccctc tgctccggag agaacacgac attcgcgtga gcctccggat ggcctctgtg 6120 

cagtatgtgc atactcagcg tttccaggca gaggtggtgg ccttcattca gcatttcact 6180 

cagctgcagg atgtcttagg gcgccagcga gctgctattg aggggcagac ggtgagagat 6240 

caagcccagc gctgttcacg ggttctcctg gatattgagg ctggtgctcc cgttctcttg 6300 

atcccagaaa gttccagatc aaataatctg attgtagcaa atttggggaa gttgaaagtc 6360 

aaaaataagt ttctgtttgc tggttttcct ggcacctttt ccctacaaga taaggaatct 6420 

gtgccttcag cttccccaac gggtattccc aaacacagtc tgaggaaaac gacaagcacg 6480 

gaggagccca ggggaaccca ttcccagggg cagttcacga tgcctcttgc tggaatgagc 6540 

ctaggaagcc tgaagagtga gtttgtgccc agtacctcca ccaagcagca agggccgcaa 6600 

cccacactgt ctgttggcca agagtccagt agtccagaag accatgtctg cctgctggat 6660 

tgcgttgtcg tggatctcca ggaccatgga ccatctttgc tgcggagaga catccggaga 6720 

gattactcga agggcaccag aggatagtag tggagatctg atcttccctt cctattttgt 6780 

gcggacagac aggaggaagc ctccttaacc ggaccttgta ggctgaaatt gcaggtggaa 6840 

aggaatttgg acaaagaaat aagtcatact gtgccagaca tatctatcca tggcaatctc 6900 

tcctcagtcc actgctctct ggatctgtat aaatacaagc tgatccgcgg cttattagag 6960 

aacaacctgg gagaacccat agaggaattt atgcggcctt atgatttaca aagatccaag 7020 

aattcatact gtcctgagtg gagaagtgta cacctgtatg tgcttcctca ttgatatggt 7080 

gaatgtaagt ctggagctta aagatccaaa aagaaaagaa ggtgctgggt ccctagccag 7140 

atttgacttc aagaaatgca aactgctcta tgaaagtttt tccaaccaaa ccaagtccat 7200 

taacttggtt tcccattcca tgatggcttt tgacacccgt tatgctgggc agaagaccag 7260 

ccctggcatg acgaatgtgt tcagctgtat ctttcagccc gctaagaaca gcagcaccac 7320 

ccaagggtcc attcagattg aactacattt cagatctacc aaggattcct cctgctttac 7380 

agtagttctc aacaatctcc gtgtgtttct catatttgac tggctactgt tagtccatga 7440 

ttttctccac actcccagtg atattaagaa acaaaatcat gttactcctt ctcgccaccg 7500 

taactctagc agcgaatctg ctatagttcc caaaactgtg aagagtggag 'tagttaccaa 7560 

gcggtcttcc cttcctgtgt ccaatgaaag gcacctggag gtcaaggtca atgtaacagg 7620 

tacggaattt gtggtcattg aagatgtgtc ctgcttcgac accaatgcca ttattctgaa 7680 

aggcaccaca gtgctcacct ataagccccg gtttgttgat cgcccctttt caggaagttt 7740 

gtttggcatt gaggtgtttt catgccgact agggaatgag catgatacag ctctttcaat 7800 

tgtggatccc gtacaaatfcc aaatggagtt ggtggggaat tcttcttatc aaaatagttc 7860 

aggattgatg gatgcattca atagtgaaga tttcccacct gtcctggaga ttcagttaca 792 0 

agccctggat atcagactct cctataatgg tgttcagctg tttcttgcca ttgcaaaatc 7980 

catcccagag caagctaatg ctgcagtgcc agactcagtg gccctggagt cagactccgt 8040 

tggcacttac cttccaggtg catctcgcgt tggagaggaa atcagagaag ggacaagaca 8100 

caccttagat cctgtcttgg agttacagct ggctaggctg caggagctgg gattcagcat 8160 

ggatgattgt cgcaaagctc ttttggcgtg tcaaggccaa ttgaaaaagg cagcaagttg 8220 

gttgtttaag aatgcggaac ctctgaagtc tctttccttg gcctccacca gccgagatag 8280 

cccaggggct gtggcagcgc cattgatctc tggcgtggag atcaaagctg agagtgtgtg 8340 

catctgtttc atcgatgact gcatggattg tgatgttcct ctcgctgaac tcaccttttc 8400 

ccgtctgaat tttcttcagc gtgtaagaac tagccctgaa ggctatgccc acttcacccfc 8460 

ttctggagat tattataacc gtgctctttc aggctgggag ccatttattg agccttggcc 852 0 

atgctctgta tcctggcaac agcaggcagc tggtcgtctc catcctcctc gactgaagct 8580 

agaagccaag gccaaacctc gtttggatat caatatcact tctgtgctaa ttgaccagta 8640 

tgtaagtacc aaggaatcgt ggatggcaga ttactgtaaa gatgacaagg acatagagtc 8700 

agctaaatca gaagactgga tgggctcttc ggtggatcct ccatgttttg gacaaagtaa 8760 

gagttttcac tacatgtgtt taactaaatc acttttgacc tacaaagcaa tgcttcactg 8820 

agaaattcaa atggaaccat cattctgcgt ggaaaaacct acaggaaatt ctgactatct 8880 

tctgtagttc tgttttagtc tttgcaggag acagttttct ggcatgaact actgaactta 8940 

taaatgtggg ctgtaattcc acctttgcgg gggagaaaag atctgcaaca cgtaattcct 9000 
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aaatcctggt aataatgcat atgaactggt gcacctgaga atgcatcagg atcagctgtg 9060 

gtttgtgttt tactctgcaa aacattggtt ttctaattga tcggattttt ttggccagat 9120 

ttttaaaagc atctaaaatg gtcttctaaa tctagagata gagggtttgg acatattctt 9180 

gaaaagaggg cgagcaggag cccatgtgga aaaagttgtt aaaaagggag ccctgtgctg 9240 

ttcatagagt ataaaagaat tgaatgccct gcctcttgaa gaaagaggag tcagcttgct 9300 

ctctattgat gatgctttct ttccttattt aatgcctcgc cggctctgtt ttcctcagag 9360 

ctggggagat ggtgatggac aactgcatgg tcttagttgt tccacaagcc attgtcatcc 9420 

aaattcaact cagctccaac tcttacaatg tggtttcttt ggggagtctt ggctggactt 9480 

cacatagctc atgaatactt aaatattcca tctggtctgc aaacgtgtca tgggaagatt 9540 

gggtgttcbt agtgtgtgaa gtaattgtac tggttcatct ctcctcaggg tgaataagtg 9600 

gactttctgt ctgccttgta cttctagaaa gagacacaaa gaaatgtcag catgtgcttg 9660 

ttctctgttt tcccaggctg ttgcttctga gtctcaaggt cattacttag gagccagggc 9720 

ttttcagata ttgtcagctt tgattttagc tcctcctcca accttcccaa tttgtgtgtc 9780 

aagaaaatgt aacataagtc tacccagaag actctagata atctttgtaa tatgtccaag 9840 

agagtgtgtt tctaaaacat tattcatgca ggggcatctt caattcagag tcaaactcta 9900 

gcaaggcctg gtctgcaaga ttctgatgaa agattttctt gtccctctct ttcttctttc 9960 

cttcttccct ccctgcctcc ctttcttcct tcctttttaa aattttgtcc ctctctctfct 10020 

ctctatagtt aaaaggcttc ctcatctata ttaaatgctg gtgctgagtt aaagcttgat 10080 

ttttatttat ttaaagattt ttatgattac ttaggcttat aaaagtgtaa tctatctttt 10140 

taataagaca taacttgaat atgcataaat ttccaaacct cttttcttat tggaaattcc 10200 

tggatggtag tacagtacat tccaggattg tattctttta gcaggcctac taaaatgatc 10260 

attgggaatt tattttgaaa gtgagtaggg ttcaaccttg ttgaactaat cataaacttt 10320 

caggtcttat tcaagaatag taagtcatat tcctacaaaa gttatgcttc ttattcagac 10380 

tggccttaaa acaagctctt gggaatagcc cttaacatga aatttagcct aagggacctg 10440 

ctgctttctc actgtttctt tatcagggga aagaaaatac cagcctcctt ttgacaaaat 10500 

tgttggcttt gttcagtcag ggctgagatt ttgcaggtaa acctgggcgg tgtggcactt 10560 

cattgaattt attagttttg tattccaagg acattttatg aaaatatcaa atattctgaa 10620 

aacttccatg cttgatttag aatttaaata aacactatga ggcaactcct tggtagagat 10680 

tgtttaaatt tggaacacat actgactctt cccctgaatt tttgagatcc cctgagactg 10740 

gcattttact tctgtgtttt accagttgga atggagcaac cttcactaaa atctgtactg 10800 

atggattttc gtaacttttt aatgccattt aagaacataa cacctttaaa ttaaaaatct 10860 

tcaaaaggct cgtcattctg ttcaagagaa ggttcaatac atattcttct aattcagg 10918 



<210> 8 

<211> 1187 

<212> DNA 

<213> Homo sapiens 



<400> 8 

tttcgtgcta ccagaagaag tgtcgggtac gtctgcatta cacctggcgg gagctctggt 60 

cagccttgat aaatttgctg aagttcctta tgtcaaatga gactgtactt ttggccaaac 120 

acaacatttt tacattagcc cttatgattg tgaacctatt taatatgttt atcacatatg 180 

gcgacacatt tctgccaacc cccagcagct atgatgaact ttactatgag attatccgca 240 

tgcaccagag ctttgacaac ctctactcca tgggcctgag gctttctacc aatgcaggcc 300 

agtggaagga agcagctagc aaggtgaccc atgcattggt taatatcaga gccatcatca 360 

accactttaa ccccaaaatt gagtcctacg ctgctgtgaa tcacatatcc caactgtcag 420 

aggagcaggt gctggagggg gtgagagcca actatgacac gctcacgctg aagctgcagg 480 

atggcctgga ccagtatgag cgctactcag agcagcacaa ggaagctgcc ttcttcaaag 540 

agctggttcg atccattagc accaacgtcc ggagaaacct ggccttccac acactcagcc 600 

aagaagtcct gctcaaggag ttctccacta tctcctgagg ccacgcctac ctgagcagcc 660 

tctgactgcc cttacccatg aggatcatgg gctggagggg gagcgagggg agagggggct 720 

gcccccgagg ttggagagaa acacagatac caagttctct tggtgaagac cgcacttcaa 780 

tggagcttgg gcagcagggg caggagggtc aagccaggga taatcttatt tggggaggaa 840 

tgggtgggca cagtggggag aggcctccag gaatgtgggg acttccagag tgggctccct 900 

aaacagcccc tcacatttcc aatcttgagg ttacaagctg tagctactga ctaattttcg 960 

ggagctggtg aggcaagcct cagggtaatg ccagcaccct gaaagtcagt ggctctgctc 1020 

ggcttctgaa aacagatcaa ctctcagata tagctggagc tacatcctgc ttcctttggg 1080 

cctggggctg tgcttggctt gaaatctgtg tcccctgcct gcactggcac tttgtccagt 1140 

catcgggacc tttggctcag atttagggta agataggaat agggtac 1187 
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<210> 9 

<211> 1978 

<212> DNA 

<213> Homo sapiens 



<400> 9 

aagaaggagt tatcgttgtc cagcaacctg tgagtcttgt acattggtta gtgtagaagt 60 

taatcccagt gcctatagag cacctttctg aagagttttc tttttcttct ttttaacata 120 

atggacaatc aacgaagtat gtaagaacca ttactgtttg ctagcctcct ttaaagggaa 180 

ttctttattg agggctacaa taaatatgct tattcacctt gggctgcttt taagctgtag 240 

atttaatatt ataaacgatc tttaaggtgg gtaatttctc agttgttcac tgggtatgca 300 

tcaagagttt taactaaatt ttctattttc cttaaattct agaaagcgaa attattattg 360 

acgatggaca atttggaatc cacaatggtg ttgaaactct ggattctgga tggctgacat 420 

gtcagactga aataagatta cgfcttgcatt attctgaaaa acctcctgtg tcaataacca 480 

agaaaaaatt aaaaaaatct agatttaggg tgaagctgac actagaaggc ctggaggaag 540 

atgacgatga tagggtatct cccactgtac tccacaaaat gtccaatagc ttggagatat 600 

ccttaataag cgacaatgag ttcaagtgca ggcattcaca gccggagtgt ggttatggct 660 

tgcagcctga tcgttggaca gagtacagca tacagacgat ggaaccagat aacctggaac 720 

taatctttga ttttttcgaa gaagatctca gtgagcaccg taagttcagg ggtgatgccc 780 

ttcctgggac atgtgggtac agcttggtct ccttatccat cccaccattg ctgaagagtg 840 

ggaaagagtg ctggaattct tactcttccc atcatgagca gaaattcccg gaaaacaata 900 

ggcaaaggga gagttgacta tataattatt aagccattac caggatacag ttgtgacatg 960 
aaatcttcat tttccaagta ttggaagcca agaataccat tggatgttgg ccatcgaggg . 1020 

gcaggaaact ctacaacaac tgcccagctg gctaaagttc aagaaaatac tattgcttct 1080 

ttaagaaatg ctgctagtca tggcgcagcc tttgtagaat ttgacgtaca cctttcaaag 1140 

gactttgtgc ccgtggtata tcatgatctt acctgttgct tgactatgaa aaagaaattt 1200 

gatgctgatc cagttgaatt atttgaaatt ccagtaaaag aattaacatt tgaccaactc 1260 

cagttgttaa agctcactca tgtgactgca ctgaaatcta aggatcggaa agaatctgtg 1320 

gttcaggagg aaaattcctt ttcagaaaat cagccatttc cttctcttaa gatggtttta 1380 

gagtctttgc cagaagatgt agggtttaac attgaaataa aatggatctg ccagcaaagg 1440 

gatggaatgt gggatggtaa cttatcaaca tattttgaca tgaatctgtt tttggatata 1500 

attttaaaaa ctgttttaga aaattctggg aagaggagaa tagtgttttc ttcatttgat 1560 

gcagatattfc gcacaatggt tcggcaaaag cagaacaaat atccgatact atttttaact 1620 

caaggaaaat gtgtagattg atggccagtg catggacttc aggtgtcgga caacccccat 1680 

tgcaatgagc tttgcacgag tttgaaaatc tgctgcggat aaatgtacat actgaagact 1740 

tgctcagaaa cccatcctat attcaagagg caaaagctaa gggactagtc atattctgct 1800 

ggggtgatga taccaatgat cctgaaaaca gaaggaaaat tgaaaggaaa cttggagtta 1860 

atggtctaat ttatgatagg tatttgtttt tcatgaaaaa ttttcatgga atttagaagt 1920 

atagatttca tcttacgaaa ttataaataa aattaaactt agttaagtta aaaaaaaa 1978 



<210> 10 
<211> 241 
<212> DNA 

<213> Homo sapiens 



<400> 10 

gtgcccgtga cctttgatga tgtggccgtg tatttctccg agcaggagtg gggcaagctg 60 

gaggactggc agaaggagct ctacaagcac gtgatgaggg gcaactacga gacgctggtc 120 

tccctggact acgccatctc caagcccgag gtcctctccc agattgaaca agggaaggag 180 

ccctgcaact ggcgccgccc tggccccaag attccagatg ttcctgtgga ccccagtcca 240 

g 241 



<210> n 
<211> 4464 
<212> DNA 

<213> Homo sapiens 
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<400> 11 

gtgtgctccg gcacttggca gctggacagg cacagtgctg atgtgaaaaa taccacgaca 60 

aaaccaccct agttctccaa aatacttatc attcaggttg aataaaccag ttctacagac 120 

tgcttcttcc atgcatttcc caatgatgct aactgccctt aaaattcttt cagatttgga 180 

actgtatctg tatggaaaca acagagtttt cttgaagtac aactaaggac aggtcaccac 240 

taggcactaa catcgctgac ttgcatgatt atggagatgg tctatctgat gctgaaaatg 300 

tctctagttt tttgacaacg gctaaataac catgggatca agtggccttg ggaaagcagc 360 

aacattagat gaactgctgt gcacttgcat tgagatgttt gatgacaatg gagagctgga 420 

taatagttat ttgccaagaa tagttctact gatgcaccga tggtatttat cttccactga 480 

attggcagaa aaacttctct gcatgtatcg aaatgccact ggagaaagct gcaatgaatt 540 

tcgattaaag atctgctact tcatgaggta ctggattctg aagtttcctg cagagtttaa 6 00 

tttggatctt ggtttgattc gtatgactga ggaatttcgg gaagtagcta gtcaactagg 660 

atatgaaaaa cacgtcagcc tcatcgacat atccagcatt ccttcctatg actggatgag 720 

aagagtcaca cagaggaaaa aagtatccaa gaagggaaaa gcctgtctgc tgtttgacca 780 

tctggagccc attgaattgg ctgagcacct cacttttctg gagcataaat cttttagaag 840 

gatctcattc actgattacc aaagctatgt catccatggc tgcctggaga ataatccaac 900 

cttggaaaga tcgattgctt tatttaatgg aatctctaag tgggtccagt tgatggttct 960 

tagcaaacca accccccagc aaagggcaga agtcatcaca aagtttatca atgttgcaaa 1020 

gaagctcctt cagctcaaaa attttaacaa ccttattgca atfcgcaggga gccctcaagt 1080 

cataggtcca tttcaggctt caaagggacc cattcctcat ctttcctcag aagtttacaa 1140 

agaactggaa tgtgaaatga cagaagtggg tctcctccaa cgggcaatta ctgcaattac 1200 

cgcaagccct ttgccgactg cgatggcttc aaaatcccca tccttggagt acacttgaaa 1260 

gacttgatag ctgtccatgt cattttccca gactggacag aggagaacaa agtgaacatt 1320 

tgtgaaaatg caccagctct ccgttaccct gagtgaacta gtctccctgc agaatgcctc 1380 

tcaccacttt agaacccaac atggatttga tcaacctgct cacgctttcc ctggacctct 1440 

atcacactga agatgatatt tacaaactgt cactggtgct ggagcctaga aattctaaat 1500 

cgcagcctac ctcccctacg acgcccaaca agcctgtggt acccctggag tgggcattag 1560 

gggtgatgcc aaagccagac cccacggtca tcaacaagca cataaggaaa ttagtggagt 1620 

ctgtatttag aaactatgat cacgaccatg atgggtacat ttcccaagag gactttgaaa 1680 

gtatagctgc caatttttcc cttcttggat tccttctgtg ttctggacaa agatcaggat 1740 

ggcctaatta gtaaagatga aatgatggcc ttacttccat gagagctaaa tcccaactac 1800 

actgtcaaat cgcgcaccag gatttatcca caattttcag gagatgactc tatctcaagc 1860 

cagaccttct gcgaacactg tgcgggattt attctgtggc ataatcaagc aaggatacaa 192 0 

atgcaaagac tgtggagcca attgtcacaa acagtgcaaa gacctcctgg ttctggcctg 1980 

caggagaatt tgcccgggcg ccctccttga gcagtggtca tgggtcactg cctggaagcc 2040 

cctcgctgcc cccagcgcag ggattaggtg tttaagttcc ctggagtcac tgctgacaac 2100 

agcgatttta gacagcagag ccatcacact ggttacaggc tcttctcgca agaacttctg 2160 

tgaggctaca gagggccacc accagccagg ccaccccaga ctggaacctg tctggtcaga 2220 

ggctggctgg gggggactcg gggtcccaca cccttcccta taatagatat tcaggttcct 2280 

tgcacaagcc accaaaaggc acaagggctt tgcaatatgg gaaaaatgaa aaagcccagg 2340 

gtggcatggc tggtggggat gtttaaaacc cgggcaccgg agtttgaacc ttgccccgga 2400 

tgaaggagag aagaccacac acggatgggt gaggatgggc tgacttcaag gctggcggaa 2460 

aacttgaagg ccaataatgg ttggcttttt ggaagggggc aagaacaaga aactcttgag 2520 

gaaagctctt gcctctcagg aagttatctt ggaaagaacc ccctgatgtt aacttgcctt 2580 

ggggacactg tgggatctcc atgtttggac tatgggacag agaattgacc ctaactaaac 2640 

tatgaattat tatgtgcctc ctcccctacc cctagttaat gtgccacaaa gactgtgtta 2700 

tgtagtcagt acttttttct catgtatctt tctctagacc atttatatac gggctgaaat 2760 

gaaagccttt ttgcatgtac ttgatactca gtctgtaaac tcagacttcg ctttttttgt 2820 

gagactatcc tttcaatatt tttataaact tttgtgtgtg ctgtggcagg gaacagttaa 2880 

caaggagttt atttagaggg gtttttttca ttttttgttt ttcttgtttt gttttgtttt 2940 

gttttgttct tcttggtgtg ctttttggat gggaccagaa cttaacattt ctcctgacga 3000 

ctaggacgct catcagacac tgggaatgca gtgattcttc taatgatggt caactgcaag 3060 

gagaaactgt ttacatcctt cccactacac agttgctatg atatgtaaca attcacacat 3120 

gtatatatca cacaagtctt accaattctg cgtcactagg aagctatagc atggtcgtga 3180 

aagctctcct aatacttact tacttaaata gccatggaag aaaattatgc ttgtcccttc 3240 

tgactgggtt aagccatctt tcttaaagat tcctgaagta gaaatagcaa ttaagaaatt 3300 

aattcaagtt ttgacggtta acaaacattt agcagaaaaa caaaccattg aataagctac 3360 

taggtgaaat gcaaaacaac catcaatttt tttaaactca ataattttgt gaaaaaaaaa 3420 

accaccacca aacttttggt tgtaaaagag aacccttgtt cccttcctaa gaaactgcct 3480 

tccacaatta aagaatactc atagaaagaa aataatcata aatcatattt gattaacatt 3540 

tcatttaagc ttttaaaata tcaatttttt aaatagctcc catttaccca aaataactag 3600 

ttatgggtct atttcttcca tgattagagg agagagaact tgattcaaga tactgaaaaa 3660 

tagagctggg actgagcctg tgaatgacag gaatgatcta ttaccaggtg actagtcgga 3 720 
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aactacacag accgtgcact ggtctgcatt tgggggctaa agtgtatata ttccatttaa 378 0 

aatggaattt ggtttggaat tggggcagtc ttcagtaaaa gcctttctcc tttcttttct 3840 

ccattgagga ccagttctgc tcagcaaaaa ttgttgagta ccctggttct gggccaggtc 3900 

ccacgctata tgcaccaaag ttaggaaaaa cttgggtctt aagcccttgg ggagcttgac 3960 

cagcccatgg gcattaaggc aaagtagttc ccagtgattt aaaatacggg ttccaaatac 402 0 

gctaaaacca aacttgtgcc aacccagatt tacagattgg aaatactgca gatgatgtga 408 0 

agttataagt tggaggagct gtgatttagg ctggataata agagaacgtg ccatctgtaa 4140 

agcactcaga aggcagccat ccctagatgt tggtttcgat gtatattaca ctatctacta 4200 

ctatccataa atgcaataat atgcatgtta acaacattaa aaacagcaaa cagcaatcta 4260 

agtacagaaa agctttttgt gtgtttaaaa aaattgaaga aaattcagga agaaacgtgt 4320 

taataaacat tgtactgttc ttttgcttct caaaggaatt attcacttgc cactttggtt 4380 

atttttgagt tttcgtacat aggaagtttt atattgccag ccttcctgtg ataaagatat 4440 

taaatgtaac ttgagtaaaa aaaa 4464 



<210> 12 

<211> 1299 

<212> DNA 

<213> Homo sapiens 



<400> 12 

caattctgtc attatctgat acaaacttca gaataccctt cacctcagcc acagctgccc 60 

agtcccacag cttctttcct cccaacccca gcacccctgc acccttagct ccaaaattcc 120 

taaaaggttc actgtttgtt cttctaattt ctccccatga tgttgcgttt gaaattggtt 18 0 

tgctctagtg ggtttccctg tagccttgcg tattcctgag ctgcaacatc agattctggt 240 

ttttctccgc agtgactttc catttcctgt gaaaagcact gtacggactg gaagaatacg 3 00 

acactcagga tgggattgcc ataaacctcc tgcccgctgc tgttttctca ggactttgca 360 

aaaaaatggg ggtcccggtg acctcacagg aatcaggcca aaaattggtg tctccgctgt 420 

gcctggctcc gtatttccgg ttactgaaac tttgtgtgga gagacagcat aacggaaact 480 

tggaggagat tgatggtcta ctaaattgtc ctatattcct aactgacctg gagcctggag 540 

agaagttgga gtccatgtct gctaaagagc gttcattcat gtgttctctc atatttctta 600 

ctctcaactg gttccgagag attgtaaatg ccttctgcca ggaaacatca cctgagatga 660 

aggggaaggt gctcactcgg ttaaagcaca ttgtagaatt gcaaataatc ctggaaaagt 720 

acttggcagt caccccagac tatgtccctc ctcttggaaa ctttgatgtg gaaactttag 780 

atataacacc tcatactgtt actgctattt cagcaaaaat cagaaagaaa ggaaaaatag 840 

tgtatactct gaaagagcga tgcctccagg ttgtccggag cctagtcaag cctgagaatt 900 

acaggagact ggacatcgtc aggtcgctct acgaagatct ggaagaccac ccaaatgtgc 960 

agaaagacct ggagcggctg acacaggagc gcattgcaca tcaacggatg ggagattgaa 102 0 

gatttctgtt gaaacttaca ctgtttcatc tcagcttttg atggtactga tgagtcttga 1080 

tctagataca ggactggttc cttccttagt ttcaaagtgt ctcattctca gagtaaaata 1140 

ggcaccattg cttaaaagaa agttagctgt cttctgtagg cattgtgatg tttaggggca 1200 

aacatcacaa aatgtaattt aatgcctgcc cattagagaa gtatttatca ggagaaggtg 1260 

gtggcatttt tgcttcctag taagtcagga cagcttgta 12 99 



<210> 13 

<211> 1417 

<^212> DNA 

<213> Homo sapiens 



<400> 13 

atgcccgcgc cgcgcgcccg ggagcagccc cgcgtgcccg gggagcgcca gccgctgctg 60 

cctcgcggtg cgcggggccc tcgacggtgg cggcgggcgg cgggcgcggc cgtgctgctg 120 

gtggagatgc tggagcgcgc cgccttcttc ggcgtcaccg ccaacctcgt gctgtacctc 180 

aacagcacca acttcaactg gaccggcgag caggcgacgc gcgccgcgct ggtattcctg 240 

ggcgcctcct acctgctggc gcccgtgggc ggctggctgg ccgacgtgta cctgggccgc 300 

taccgcgcgg tcgcgctcag cctgctgctc tacctggccg cctcgggcct gctgcccgcc 360 

accgccttcc ccgacggccg cagctccttc tgcggagaga tgcccgcgtc gccgctggga 420 

cctgcctgcc cctcggccgg ctgcccgcgc tcctcgccca gcccctactg cgcgcccgtc 480 

ctctacgcgg gcctgctgct actcggcctg gccgccagct ccgtccggag caacctcacc 540 
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tccttcggtg ccgacgcagg tgatggatct cggccgcgac gccacccgcc gcttcttcaa 600 

ctggttttac tggagcatca acctgggtgc tgtgctgtcg ctgctggtgg tggcgtttat 660 

tcagcagaac atcagcttcc tgctgggcta cagcatccct gtgggctgtg tgggcctggc 720 

atttttcatc ttcctctttg ccacccccgt cttcatcacc aagcccccga tgggcagcca 780 

agtgtcctct atgcttaagc tcgctctcca aaactgctgc ccccagctgt ggcaacgaca 840 

ctcggccaga gaccgtcaat gtgcccgcgt gctggccgac gagaggtctc cccagccagg 900 

ggcttccccg caagaggaca tcgccaactt ccaggtgctg gtgaagatct tgcccgtcafc 960 

ggtgaccctg gtgccctact ggatggtcta cttccagatg cagtccacct atgtcctgca 1020 

gggtcttcac ctccacatcc caaacatttt cccagccaac ccggccaaca tctctgtggc 1080 

cctgagagcc cagggcagca gctacatgat cccgtgagcc tggctccttc tggccaatgt 1140 

tgcgtgggag ctgagtctgc gccctctgaa cgaccgcttg agcgaccctt tactgctgcg 1200 

gtgcaagctg cttccctctg ctctgcagaa gatggcgctg gggatgttct ttggttttac 1260 

ctccgtcatt gtggcagcca cagcccagct agcccaggcc aagggagtcc gacccccacc 1320 

ccgaagcaca gggaagttgc tgccctttcc cgcagtgaca ggaatcctct caaaggttct 1380 

ggaaggcaag gatgcggcca tccctggaga cgccaag 1417 



<210> 14 

<211> 620 

<212> DNA 

<213> Homo sapiens 



<400> 14 

ttttttttca gggctggaag ttgattttta atgataaagt acaatgaagg gagggcagag 60 

gggctaagcc tagctgtccc ggggtgctgt ggtggtggta gactggctac acaaactgtt 120 

gctgctgctg ctgcttcttg gtggccgcct tgctggcgag ggtccttggc cttctctgta 180 

gctgccagtg cggtctcctt tgccttctcc ttggcttcct tggctgtctc aacagggggg 240 

ttggaagggg cctcgccttg cagcttagcc aagatatatt caaaaccctt cataggtctt 3 00 

ggtcacgttg cttttgaacc gggcaagacc aaattcctgg gacagctctt ggagacacca 360 

aattaagcta ggaggagacc caggcttccc ggcggatttt cagtccagcc cactgttgtc 420 

aagagttcac aacagtaaaa cacatcgttt cctccaccac catcagccgg ggctgtggtt 480 

ggatgttcca ggtgaaggta gtcatggtct gattctgtgg gtccacaata gagtcctcca 540 

ggatgtacac tgaatgaaga acattggcag gaaacagttg cttggcccag tgggacatcc 600 

ctgtggtctt ggtcaggagt 620 



<210> 15 

<211> 2528 

<212> DNA 

<213> Homo sapiens 



<4O0> 15 

tttcgtcgct gaccagcagc atggaggacc ccgcegctcc tgggaccggg ggcccgcccg 60 

caaatggcaa tggcaacgtg tggcggcaaa gggaagcagg cggcgcccaa gggccgcgaa 120 

gcgttccgaa gccagcggcg ggagtcagag ggctctgtgg actgtcccac tctggagttt 180 

gagtatggag atgcagatgg gcatgcagcc gagttgtcag aattgtatag ttacactgag 240 

aacctggaat tcaccaataa caggaggtgc tttgaagaag atttcaagac tcaagtgcag 3 00 

ggcaaggaat ggctggagtt ggaagaagat gcccaaaagg cctatataat gggactcttg 360 

gaccggctag aggtggtcag tagggaacgg cggctgaagg cggcccgggc tgttctctac 420 

ctggcccaag gtacttttgg ggaatgtgat tcagaggtcg atgtgctaca ctggtccagg 480 

tacaactgct tcctgctgta tcagatgggg accttctcca ccttcctgga gctactccac 540 

atggaaattg acaacagcca ggcctgtagc agtgcccttc ggaaaccagc tgtctccata 600 

gctgatagca cagagctcag ggtgctgctg agtgttatgt acctaatggt ggaaaatatt 660 

cgcctggagc gagagacaga cccctgtggg tggagaacag cccgggagac cttccgcact 720 

gaattaagct tctccatgca taatgaggag ccttttgccc ttttactctt ctccatggtt 780 

accaagttct gcagtggcct ggctcctcac ttccccataa agaaggtcct gctcctgctc 840 

tggaaggtgg tcatgtttac cctcggtgga tttgagcatc tgcagactct caaagtacag 900 

aagcgggcag aattgggcct gcctccactg gctgaagaca gtatccaggt ggtgaagagc 960 

atgcgtgctg cctccccgcc ctcttacact cttgacctgg gagagtctca gctggcaccc 1020 

ccaccctcca agctgcgagg ccgccgtggc tctcgaaggc aactcctcac taagcaggac 1080 
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agcctggaca tctacaatga aagggatctc ttcaagactg aggagcccgc cacagaggag 1140 

gaagaggagt ctgctggtga tggagaacga accttggatg gagagctaga cctgctagag 1200 

caggaccctc tggtgccacc tccaccctca caggcacccc tctctgctga gcgggtggct 1260 

tttcccaagg gcctgccctg ggccccaaag gtcagacaga aggggacatt gagcacttct 1320 

tggagatgag caggaacaag ttcatcggat tcaccctggg tcaggacaca gatacattgg 1380 

ttggattacc caggcccatc catgagagtg tgaagaccct aaagcagcac aagtatatct 1440 

ccatcgcaga tgtgcagatc aagaatgaag aggagctgga gaagtgccct atgtctttgg 1500 

gggaagaggt ggtaccagag acgccatgtg aaatcctcta ccagggaatg ctgtacagcc 1560 

ttccgcagta tatgatcgct ctgcttaaga ttctgctggc tgcagctccc acctctaagg 1620 

ctaagacaga ctctatcaat atcctggcag atgtcctacc tgaggagatg cccatcactg 1680 

ttctccagag catgaagctg ggcatcgatg tgaacaggca caaggagatt attgtaaaga 1740 

gtatctctac cctgcttctg ctactcctca aacacttcaa actcaaccat atctaccagt 1800 

ttgaatatgt atcgcaacat ttggtatttg ccaactgcat ccccttgatc ctgaagttct 1860 

tcaatcaaaa tatcttgtca tacatcactg ccaaaaacag catctcagtc ctggattatc 1920 

cttgctgtac catccaggat ttgccggagc ttactactga aagtctggaa gctggagaca 1980 

acagccagtt ctgctggagg aacctctttt cctgcatcaa cctcctgagg ctgctcaata 204 0 

aactgaccaa atggaaacat tcccggacca tgatgctggt agtgtttaaa tcggcaccaa 2100 

tcttaaagcg ggccctcaag gtcaaacagg ccatgctgca actttatgtc ctaaagctac 2160 

taaagttaca gaccaagtac ctggggcgcc aatggaggaa aagcaacatg aaaaccatgt 2220 

cagccattta ccagaaagtg cgtcaccgca tgaacgatga ctgggcttac gggaatgaca 2280 

tcgatgccag accatgggac ttccaagcag aagaatgtac cttgagggcc aacattgagg 2340 

cttttaacag ccgtcgctat gacagacccc aggactctga gttttcacct gtggataact 2400 

gcttgcagag cgtactgggg cagaggttgg atctgcctga agatttccac tattcatatg 2460 

agctctggct cgagagagag gtgttttcac agcccatctg ttgggaggag ctgctccaga 2520 

atcactga 2528 



<210> 16 
<211> 2447 
<212> DNA 

<213> Homo sapiens 



<400> 16 

gagcgggagg cgagtggcga gtggcgagtg gcgagtgtca ggggggcggc cggcggtggc 60 

ggggcggccg gaggaggcgt tggcagcggg ctcggaccca cgcggcgccg cggcccgcct 120 

ggcctgcagc gctcccaccc ccggcggcgg cacgatgccc tttgacttca ggaggtttga 180 

catctacagg aaggtgccca aggaccttac gcagccaacg tacaccgggg ccattatctc 240 

catctgctgc tgcctcttca tcctcttcct cttcctctcg gagctcaccg gatttataac 300 

gacagaagtt gtgaacgagc tctatgtcga tgacccagac aaggacagcg gtggcaagat 360 

cgacgtcagt ctgaacatca gtttacccaa tctgcactgc gagttggttg ggcttgacat 420 

tcaggatgag atgggcaggc acgaagtggg ccacatcgac aactccatga agatcccgct 480 

gaacaatggg gcaggctgcc gcttcgaggg gcagttcagc atcaacaagg tccccggcaa 540 

cttccacgtg tccacacaca gtgccacagc ccagccacag aacccagaca tgacgcatgt 600 

catccacaag ctctcctttg gggacacgct acaggtccag aacatccacg gagctttcaa 660 

tgctctcggg ggagcagaca gactcacctc caaccccctg gcctcccacg actacatcct 720 

gaagattgtg cccacggttt atgaggacaa gagtggcaag cagcggtact cctaccagta 780 

cacggtggcc aacaaggaat acgtcgccta cagccacacg ggccgcatca tccctgcaat 84 0 

ctggttccgc tacgacctca gccccatcac ggtcaagtac acagagagac ggcagccgct 900 

gtacagattc atcaccacga tctgtgccat cattggcggg accttcaccg tcgccggcat 96 0 

cctggactca tgcatcttca cagcctctga ggcctggaag aagatccagc tgggcaagat 1020 

gcattgacgc cacacccagc ctaatggccg aggaccctgg gcatcgccag ccttgcctcc 108 0 

agtgccctgt ctcctttggc cctcaatctg gtcccaaatc tggctgtgtc ccaaagggtg 1140 

tgtgggaagt ggggggaaag tagaggatgg ctcgatgttt tgcagctacc tcttttcccc 1200 

gtgtttcttt ttagacaaat tacactgcct gaagttgcag ttcccctttc cctggggagc 1260 

cccaagaaca gagtcaggca aggggtgggg agtccaggga tcttggggac ccctcctagg 1320 

agagctgcag tctcttccct caggggaaca tcccagaatg catatcgatc agctctcagc 1380 

caggcttcga caatctcgca gcccccacta ggtggacaca ttaatgattt ggtttctccc 1440 

ctgggcagcc aacctgcccc agaggcacca gacctgggct ttcagctttg ggaccaggct 1500 

gcccaaaggt actcctttat acacccggca ccttccacga aagatggtac ttcccaagca 1560 

agcccctatg atttgtcact atagatggaa ccctgacttc tgccccatcc cttcctgccc 1620 

aacctagaac ccaggcctca agtctttacc ccaccccttt cttgttcttc caagaagcag 1680 

atgcccagtt gctcagcagc agcggtagag acttgaatct gcccaccagt cacaaggcgg 1740 
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gtcacagatt cctcttcctc tcttctcctc gttcctctga accctccacc aatgtgcctc 1800 

agcctgtgtg ctgtgtggca acagcattct ggttcccact gccaagatct cccaccactc 1860 

tgctgggatc tgcagtggca gggagtgggg gttgtgtaaa ggggaagtca tcttttgaga 1920 

tccagataga catggtttgt gcacttacgt ccagatggga agcatccttc ctgcaaccct 1980 

aaaataatca tgcagcctct cagacggacg ccatcggtcc caaggcctta ggtggaggaa 2040 

gcaaagcagg ccaggcctgt cctgtccgtg gacctctacc ttctggactc cctacgggtg 2100 

cagagcactt gggtttctct acagccatcg tggcccactt gacactgtgc tcctccatca 2160 

gctggtcaca tgccaacacg ttcccagccc ctgaggcagc tccagggtgc cccacctgct 2220 

cctgaggtgg gtccctaccc tgctgctcct cttcatcctt tcccttttgt cctgaaaggg 2280 

aggagcaatg gtccaggcat taattccacc cagggaattt tagctatgcc ctcatgtccc 2340 

agggagagag ccacacgcct gttttccatt tatagcaaga ttgtttgcat acttttgtaa 2400 

tgaaggggag tgtccagtgg aaggattttt aaaattatct tatggat 2447 



<210> 17 
<211> 2657 
<212> DNA 

<213> Homo sapiens 



<400> 17 

ctgggctcgg ccacaactgt gttccacacc agagtgggtg ctagggatag ggagaggcca 60 

gggagtggga gcaggcactt ctgagcttgt gggggaagga ggcttcctga gccctttaca 120 

ttgcatcaca taggaatcca cataatacct ggttgacgca acatcagtcc aactgaccac 180 

aaagcaacta ggggtttcag agggagtaca cgtaaggggc aaaagcagca acggaatacg 240 

acaacggaat aagcatgaat acgacctggc tacctgaagg aggtacaaga aaattcccca 3 00 

ggacacagag ctggtttgga gcctttcttt gattttattg tttctattaa tggttcaaga 360 

ttaaataaag acaatgacac tcttaaggat ctgctgaaag caaacgttga aaagcctgta 420 

aagatgctta tctatagcag caaaacattg gaactgcgag agacctcagt cacaccaagt 480 

aacctgtggg gcggccaggg cttattggga gtgagcattc gtttctgcag ctttgatggg 540 

gcaaatgaaa atgtttggca tgtgctggag gtggaaatca aattctcctg cagcactggc 600 

aggtcttaga ccacacacgt gattatataa ttggagcaga tacagtcatt gaatgagtct 660 

gaagatctat tcagccttat cgaaacacat gaagcagaac acattgaaac tgtatgtgta 72 0 

ccacacagac acgtgagtaa cggtcgagaa gtgattatta caccaaattc tgcatggggt .780 

ggagaaggca gcctaggatg tggcattgga tatggttatt tgcatcgaat acctacacgc 840 

ccatttgagg aaggaaagaa aatttctctt ccaggacaaa tggctggtac acctattaca ,. 900 

cctcttaaag atgggtttac agaggtccag ctgtcctcag ttaatccccc gtctttgtca 96 0 

ccaccaggaa ctacaggaat tgaacagagt ctgactggac tttctattag ctcaactccc 1020 

accagctgtc agtagtgttc tcagtacagg tgtaccaaca gtaccagtta ttgccaccac 1080 

aagtaaacca gtccctcact tctgtgccac caatgaatcc agctactaca ttaccaggtc 114 0 

tgatgccttt accagcagga ctgcccaacc tccccaacct caacctcaac ctcccagcac 1200 

cacacatcat gccagcgggt tggcttacca gaacttgtaa acccaggttt tgccacctct 1260 

ttcttccatg cctccccgaa acttacctgg gcattgcacc tctccccctg ccatccgagt 1320 

tcctcccgtc attccccttg gttccagaga gctcttctgc agcaagctca ggagagctgc 1380 

tgtcttccct cccgcccacc agcaacgcac cctccgaccc tgccacaact actgcaaagg 1440 

cagacgctgc ctcctcactc actgtggatg tgacgccccc cacctgccaa gggccccccc 1500 

accgttgagg acagatgtcg gcgactccac cccattcaga gagaaacctg tttctgcggc 1560 

tgaggatgcc aatgcttctg agtcaccctt aactttgaac cattctttgg aaatgggggt 1620 

ggtatattta accacggggg ccgtttctgg aaacgcaaac tatcattaat ttcatactag 1680 

tttgtaccgt atctgtcagg catcctgtaa ataattccca aggggaaaac taaacgagga 1740 

cgtgggttgt atcctgccca ggttgagtgg ggctcacacg ctagggtgag attgtcagaa 1800 

agcgcttgta ttttaaacaa ccaaaaagaa ttgtaagggt ggcttgctgc ccaggctttg 1860 

cactgcccgt tccctggggg tgtgcatctt cgggaaaggt ggtggcgggg cgtccactag 1920 

gtttcctgtc cccgtgctgc tcccttccgg taaggaaaat gaaatattct atgcctaata 1980 

ctccacaggc aacatttctt ggtactttgg taagtcgttt tgcggagaat gcagaccacc 2040 

tcacttaaac ttgtaaaccg gtaaaagaga tttttacttt tgggtctccg gtgagtcgca 2100 

tcttcttact tagggtttac acaggaaatt ccacctggaa gacttgtgtt aaaagttcta 2160 

cagcggcgca ctggttaacc tgaaccgtct tttttcttca gcctatacgc ggatcccttg 2220 

ttttggagct ctcagaatca actcagacaa acattttgta aacggctgct ggttgctttc 2280 

tacatacacc ttataaagtg acatttcaaa aagaaatcaa ggtgccacag ttttaaacca 2340 

gaaggtggca ctttttcttc ttttgggggt gggggcctcc cttgtagtat tatagctata 2400 

ctggggaaag ccataggtac aggcaattaa agtaccagta attttacttt ttttcttgtg 2460 

ttacatctaa attacaaccc ttaattgcca cgtgtgcact tactactctc cagtatgtct 2520 
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tattactctc cagtatgtca cgcatcttta acttttcacg tcctatgttt gctttctccc 2580 
atttttaaga gatggtaagt taactggaat tgatttactg aatgaaatta aatgcagata . 2640 
tccctgtttt tgaaata 2657 



<210> 18 

<211> 298 

<212> DNA 

<213> Homo sapiens 



<400> 18 

gaatcctcac tatgggaaat atttgtatct taaaaggaaa ttaaatgtag ttaacccata 60 

cagcccagaa attattgtaa aatttgaaaa tttccaggtt gtgtcctcta tccatatcct 120 

gtgcattcat aagctaaaac tttcatcaaa ccaagaagat gccgtgattg tggagcaacc 180 

agaagtgatt ccattaacag aggaccaaga agaaaaagaa ggtgaaaaag cagaaattgt 240 

tggctctgtt tatggcattt caagaaatgc aaatggtaaa gaggaaaatc aaacagct 298 



<210> 19 

<211> 3892 

<212> DNA 

<213> Homo sapiens- 



<400> 19 

cctcccatct cccttccccg aaggcagcaa catccccagc ttctacacag tcgcaccata 60 

ggcaagcgtc cttggacctc gccactgtcc ccagagcaac tcttggagaa caacagtgca 120 

gataaagcct ctggttctgt gtccaccacg ttgccggttg cttataaaaa gggagtcaaa 180 

catctttctg ctcttcttga tccagaggtc ctgtccattt tcgtgcctcc ttttatcagt 240 

aaagaggaca gtcaaacggc cggtgccaac tgcggcactc tcggtaaaac ccggatgcgc 3 00 

tccttgagaa agaagagaga gaagcccaga ccagagcagt gggagggcct cccggggccc 360 

cccagagcgc cagagcctga ggatgtcgcc gtcccgggcg gcgtggacct cctcaccctg 420 

ccgcagctgt gcttcccagg gggtgtgtgc gtggccactg aacctaagga ggattgcgtc 480 

cacttcctgg tgctgaccga tgtctgcggg aataggacct atggcgtggt ggcccagtac 540 

taccggcccc tgcatgatga gtactgtttc tacaatggca aaaccgcacc gggagtgtcc 600 

tggctgcttc gtgcccttcg cggtgtgcgt ggtctccagg tttccctatt acaactccct 660 

caaggactgc ctttcctgtt tattggctct tctgaagccc tgtaaagatt ttgaagtgga 720 

caatcatata aaagatttcg ctgcgaagct gtctttaata cccagcccgc cacctggacc 780 

gctccatttg gtatttaaca tgaagtcgct ccagattgtg ttacctgccc gagcagaccc 840 

cgaaagcccc atcctggacc tggaccttca cctgcccttg ctgtgcttcc ggcctgagaa 900 

ggtgctacag atcctgacat gcatcctgac ggaacagcgg atcgtcttct tctcctcgga 960 

ctgggctctg ctgacgctgg tcactgagtg cttcatggcc tacctgtatc cgctgcagtg 1020 

gcagcacccc ttcgtgccca tcctgtcgga ccagatgctg gattttgtca tgggccccac 1080 

atccttcctg atgggctgcc atctcgacca cttcgaagaa gtcagcaagg aagccgacgg 1140 

tttagttctg ataaatattg atcatgggag catcacctac tccaagtcca cggacgataa 1200 

cgtggacatt cctgatgtcc ccctcctggc agcccagacg tttattcaga gggtgcagag 1260 

cctccagctc caeca tgagc tgcacgccgc ccacctcctc tccagcacag acctgaagga 1320 

gggccgagcc caccggcggt cctggcagca gaaactcaac tgecagatae agcagaccac 13 80 

cctgcagctg ctegtgagea tcttcaggga tgtaaagaat catttaaact atgaacacag 1440 

agtctttaat agtgaagaat ttctcaaaac cagggctcca ggggaccatc agttttataa 1500 

gcaggtctta gacacctaca tgttccattc ttttcttaaa gcccggctca ataggaggat 1560 

ggacgccttt gctcagatgg acctcgacac ecagteggag gaggacagaa taaatggaat 1620 

gcttctaagt ccaaggagac cgaccgttga gaaaagagee teceggaagt cctcgcaccc 1680 

gcatgtcacc cacaggcgca tggtggtcag catgcccaac ctgeaggaca ttgccatgcc 1740 

tgagctggca cccaggaact cctcgctccg getgaeggae accgcaggct gtaggggcag 1800 

cagegcagtt ctgaatgtca cgccgaagtc cccgtataca ttcaagattc ccgaaatcca 1860 

ctttccgctg gagagcaagt gcgtgcaggc ataccatgcc cactttgtct ecatgetgag 1920 

egaggecatg tgctttctgg cccccgataa ctctctgctc ctggcccgct atttgtacct 1980 

ccgagggctc gtttatctga tgcagggaca getgetgaac gccctcttgg acttccagaa 2040 

tctgtataaa acagacatac ggatctttcc cactgatttg gtgaagagga cggtggaatc 2100 

catgtctgcc cctgagtggg agggggctga gcaggcgccg gagctgatga ggctcatcag 2160 
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cgagatcctg gacaagccgc acgaggcctc gaagctggac gaccacgtga agaagttcaa 2220 

gctgcccaag aagcacatgc agctgggcga cttcatgaag cgggtccagg agtcagggat 2280 

cgtgaaggac gccagcatca tacaccggct gttcgaggcc ttgactgtag gacaggagaa 2340 

acaaatcgac ccagaaacat tcaaagattt ctacaactgc tggaaggaga cggaagcaga 2400 

agcccaggag gtcagtctgc cgtggctggt gatggaacac ctggataaaa acgagtgtgt 2460 

gtgtaagttg tccagctccg tcaagacaaa cctaggcgtt ggcaagatcg ccatgaccca 2520 

gaagcgcctg ttcctcctaa ccgaaggaag gccaggctac ttggagattt ccaccttcag 2580 

aaatatagag gaggtcagga gaaccactac tacatttcta cttcggagaa tacccacttt 2640 

aaaaatcaga gtggcgtcca agaaagaagt cttcgaagcc aacctgaaaa ccgagtgtga 2700 

cctttggcac ctgatggtga aggagatgtg ggctgggaag aagctggccg atgaccacaa 2760 

ggaccctcac tacgtccagc aggcgctgac caacgtcttg ctgatggacg ccgtcgtggg 282 0 

cacactgcag tcaccaggcg ccatctacgc tgcctccaag ttatcctact ttgataagat 2880 

gagtaacgaa atgcccatga cgcttccgga gacaaccctg gaaacactga agcataaaat 2940 

caacccctcg gcgggggagg cgttcccaca agcggtggac gtgctgctct acactccagg 3000 

gcatcttgac ccagccgaaa aagttgaaga tgctcacccc aagttatggt gtgctctgag 3060 

cgaaggcaag gtgaccgtgt tcaatgcttc ttcatggacc atccaccagc actcctttaa 312 0 

agtgggcact gcaaaagtga actgcatggt gatggccgac cagaaccagg tgtgggttgg 3180 

ctcggaagac tccgtcatct acatcatcaa cgtccacagc atgtcctgca acaagcagct 324 0 

cacagcccac tgctccagtg tcacggattt gattgtgcag gacggacagg aggcacccag 3300 

caacgtgtac tcgtgcagca tggacggcat ggtgctggtg tggaatgtga gcacactgca 3360 

ggtgaccagc cgcttccagc tgccgcgagg tggcctgacg tccatcagac tgcacggcgg 3420 

ccgcctgtgg tgctgcacag gtaacagcat catggtcatg aaaatgaatg gatccctcca 3480 

tcaagaattg aagattgagg agaacttcaa agacaccagt acctccttcc tggccttcca 3540 

gctccttcct gaggaggagc agctgtgggc ggcctgtgca ggacgcagcg aggtttacat 3600 

ctggagcctg aaggacctgg cccagccccc gcagagggtg cccctcgagg actgctctga 3660 

gatcaactgc atgatccggg tgaagaagca ggtctgggtg ggcagccgag ggctggggca 3720 

gggaacaccc aaggggaaaa tctacgtgat tgacgccgag aggaagaccg tggagaagga 378 0 

gctggtggcg cacatggaca ccgtgaggac gctgtgctcg gctgaggaca gatacgtgct 3840 

gagtgggtcg ggcagggagg aggggaaagt cgccatttgg aaaggcgaat aa 3892 



<210> 20 

<211> 1205 

<212> DKTA 

<213> Homo sapiens 



<400> 20 

atgcagccta cacctggttg actggacgac cactttagtg gaaaggagac agtctgggcc 60 

tctggcctgc catgtgcaca agcataacaa ttatttttgt ttaacatgca gatggaatat 120 

ttgatccctt ttaaccaggc atttgcacct tggtatcttg tcttaattgc caaagtttgt 180 

tttaagtctt taacttctat gatcctctag taaaaatgaa aggtatgggt ttaacgaaaa 240 

ttgcaaaaaa cctggtgggg ccagtccatc cttgctcttt actggtccac agaacgttgg 300 

accaactatg gcataaaaag ctctacattg gtgggacaag actcctggtt gacactgggg 360 

tctttatcaa aatctccccg gattaaatgg tcccaattct ctaatgccca gtctgacgag 420 

agtcaggagg gacagaggta cttttctgaa gtagagagct gtctttgcct tggcaagtcc 480 

ccacagggta taacaaggca agcattaaat gcaaaagttt gaagcaaaat tgacttggtt 540 

atgttaataa ctagatggtc agcaatagag tgaggaaaga agaaagagta ttacaataga 600 

agaaagagag agagttacat ttttcttagc tttagcttgg tagggttttc ccctgggact 660 

atggcccacg actctggagg gggtggcact ttcttgactt gggtgtgatg agtccatctt 720 

tttttttttt tttttttttt tttgctgtac gaacagcagt cttggtggtt agcagcacag 780 

ggtagggtcc ttcccaggct ggctcgagtt ttctttcttt ccaccctttg atgagaatgt 840 

gatcctcaag ctggtgctgg tttactggaa attcagttac tttgtgccat acctttgaaa 900 

ctagggactt gtctagactt ccttctgatg gccatcccac ctctaatgcc ggccaatcta 960 

tctcacacaa agccttaagc tttccaggtg tcatagcaat tccatagtcc ccattggatc 1020 

ctttcttgaa attctttaac acagttccta gtggggtggg cttactttgt gcctgaccca 1080 

tgcttcctca agacaaaaca ccacactcac accatacaca caccacaaaa caaagaacag 1140 

gtaaaaaggg cacacacaca cttttacagt ttataccaaa ccagaatcaa aaccaaagtc 1200 

agaat 1205 



<210> 21 
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<211> 4289 

<212> DNA 

<213> Homo sapiens 



<400> 21 

agagccaccg cggagcgcgc gcggggttgg ttgccgcgag cgtgggggag cgtggaccgc .60 

ggcgctgctc agcggtgggg ctgccttccc ccggccctcc tccctggtcc ctggcgaggg 120 

cactggcggc ggcggggccg gggtccgcaa ggccggagaa ggccgccggg cccgggcatg 180 

gtggtctggg gcaacgcgga agaagctcca ccatgaggcg aggtggatgg aggaagcgag 240 

ctgaaaatga tggctgggaa acatggggtg ggtatatggc tgccaaggtc cagaaattgg 3 00 

aggaacagtt tcgatcagat gctgctatgc agaaggatgg gacttcatct acaattttta 360 

gtggagttgc catctatgtt aatggataca cagatccttc cgctgaggaa ttgagaaaac 420 

taatgatgtt gcatggaggt caataccatg tatattattc cagatctaaa acaacacata 4 80 

ttattgccac aaatcttccc aatgccaaaa ttaaagaatt aaagggggaa aaagtaattc 540 

gaccagaatg gattgtggaa agcatcaaag ctggacgact cctctcctac attccatatc 600 

agctgtacac caagcagtcc agtgtgcaga aaggtctcag ctttaatcct gtatgcagac 660 

ctgaggatcc tctgccaggt ccaagcaata tagccaaaca gctcaacaac agggtaaatc 720 

acatcgttaa gaagattgaa acggaaaatg aagtcaaagt caatggcatg aacagttgga 780 

atgaagaaga tgaaaataat gattttagtt ttgtggatct ggagcagacc tctccgggaa 840 

ggaaacagaa tggaattccg catcccagag ggagcactgc catttttaat ggacacactc 900 

ctagctctaa tggtgcctta aagacacagg attgcttggt gcccatggtc aacagtgttg 960 

ccagcaggct ttctccagcc ttttcccagg aggaggataa ggctgagaag agcagcactg 1020 

atttcagaga ctgcactctg cagcagttgc agcaaagcac cagaaacaca gatgctttgc 1080 

ggaatccaca cagaactaat tctttctcat tatcaccttt gcacagtaac actaaaatca 114 0 

atggtgctca ccactccact gttcaggggc cttcaagcac aaaaagcact tcttcagtat 12 00 

ctacgtttag caaggcagca ccttcagtgc catccaaacc ttcagactgc aattttattt 1260 

caaacttcta ttctcattca agactgcatc acatatcaat gtggaagtgt gaattgactg 1320 

agtttgtcaa taccctacaa agacaaagta atggtatctt tccaggaagg gaaaagttaa 1380 

aaaaaatgaa aacaggcagg tctgcacttg ttgtaactga cacaggagat atgtcagtat 144 0 

tgaattctcc cagacatcag agctgtataa tgcatgttga tatggattgc ttctttgtat 1500 

cagtgggtat acgaaataga ccagatctca aaggaaaacc agtggctgtt acaagtaaca 1560 

gaggcacagg aagggcacct ttacgtcctg gcgctaaccc ccagctggag tggcagtatt 1620 

accagaataa aatcctgaaa ggcaaagcag cagatatacc agattcatca ttgtgggaga 16 80 

atccagattc tgcgcaagca aatggaattg attctgtttt gtcaagggct gaaattgcat 1740 

cttgtagtta tgaggccagg caacttggca ttaagaacgg aatgtttttt gggcatgcta 1800 

aacaactatg tcctaatctt caagctgttc catacgattt tcatgcatat aaggaagtcg 1860 

cacaaacatt gtatgaaaca ttggcaagcc tacactcata acattgaagc tgtcagttgt 1920 

gatgaagcgc tggtagacat taccgaaatc cttgcagaga ccaaacttac tcctgatgaa 198 0 

tttgcaaatg ctgttagaat gagaaatcaa agaccagacg aaatgtgctg cctctgatgg 2040 

aattggttct aatattctcc tggctagaat ggcaactaga aaagcaaaac cagatgggca 2100 

gtaccaccta aaaccagaag aagtagatga ttttatcaga ggccagctag tgaccaatct 2160 

accaggagtt ggacattcaa tggaatctaa gttggcatct ttggcaatta aaacttgtgg 2220 

agacttgcag tatatgacca tggcaaaact ccaaaaagaa tttggtccca aaacaggtca 228 0 

gatgctttat aggttctgcc gtggcttgga tgatagacca gttcgaactg ataaggacag 2340 

aaaatctgtt tcagctgaga tcaactatgg gaataaggtt tactcagcca aaagaggcag 2400 

aagcttttct tctgagtctt tcagaagaaa ttcaaagaag actagaagcc actggcatga 2460 

agggtaaacg tctaactctc aaaatcatgg tacgaaagcc tggggctcct gtagaaactg 2520 

caaaatttgg aggccatgga atttgtgata acattgccag gactgtaact cttgaccagg 2580 

caacagataa tgcaaaaata attggaaagg cgatgctaaa catgtttcat acaatgaaac 2640 

taaatatatc agatatgaga ggggttggga ttcacgtgaa tcacctggct tccatctaat 2700 

ctgaaccctt ccacatgtcc cagtcgccca tcagttcagt caagccactt tcctagtggg 2760 

tcatactctg tccgtgatgt cttccaagtt cagaaagctt aagaaatcca ccgaagagga 2820 

gcacaaagaa gtatttcggg ctgctgtgga tctggaaata tcatctgctt ctagaacttg 288 0 

cactttcttg ccaccttttc ctgcacatct gccgaccagt cctgatacta acaaggctga 2940 

gtcttcaggg aaatggaatg gtctacatac tcctgtcagt gtgcagttcg agacttaacc 3000 

tgagtataga ggtcccgtca ccttcccagc tggatcagtc tgttttagaa gcacttccac 3060 

ctgatctccg ggaacaagta gagcaagtct gtgctgtcca gcaagcagag tcacatggcg 312 0 

acaaaaaaga aagaaccagt aaatggctgt aatacaggaa ttttgccaca accagttggg 3180 

acgagtcgtt gttgcaaata ccgagaacct caagaatcga acaagtgacg caggaataaa 3240 

tttaatagcc cttccagcat tttcgacagg tggaccctga ggtatttgct gccctttctt 3300 

gctgaacttc agagggagct gaaagcagcg tatgatcaaa gacaaaggca gggcgagaac 3360 

agcactcacc agcagtcgag ccagcgcatc tgtgccaaag aatcctttaa tatcatctaa 3420 

aggcagcgag tgaaagaaaa gaaaagaaac aagaagaaaa aaaccattgg ttcaccaaaa 3480 
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aggattcaga gtcctttgaa taacaagctg cttaacagtc ctgcaaaaac tctgccaggg 3540 

gcctgtggca gtccccagaa gttaattgat gggtttctaa aacatgaagg acctcctgca 3600 

gagaaacccc tggaagaact ctctgcttct acttcaggtg tgccaggcct ttctagtttg 3660 

cagtctgacc cagctggctg tgtgagacct ccaccacccc aatctagctg gagctgttga 3720 

attcaatgat gtgaagacct tgctcagagg aatgggtaac tacaatttca gatccaatgg 3780 

aagaagacat tctccaacgt tgtgaaatac tgtactgatc taatagaaga caaagatttg 3840 

gaaaaactgg atctagttat aaaatacatg aaaaggctga tgcagcaatc ggtggaatcg 3900 

gtttggaata tggcatttga ctttattctt gacaatgtcc aggtggtttt acaacaaact 3960 

tatggaagca cattaaaagt tacataaata ttaccagaga gcctgatgct ctctgatagc 4020 

tgtgccataa gtgcttgtga ggtatttgca aagtgcatga tagtaatgct cggagttttt 4080 

ataattttaa atttctttta aagcaagtgt tttgtacatt tcttttcaaa aagtgccaaa 4140 

tttgtcagta ttgcatgtaa ataattgtgt taattatttt actgtagcat agattctatt 4200 

tacaaaatgt ttgtttataa agttttatgg atttttacag tgaagtgttt acagttgttt 4260 

aataaagaac tgtatgtata ttttgaaaa 4289 



<210> 22 

<211> 686 

<212> DNA 

<213> Homo sapiens 



<400> 22 

ctctcattca agtaaattct actcattagt caatgatttc tcctcacatc gattttgtgg 60 

gagtggatct gattagatca ggtaaagaaa ctcgagtgtg gtgaatcact gccctcccaa 120 

ggtcccttac acatttttta agacagagca gaggagcaag tccacgtgga ggtttaaatt 180 

caagccagct gtgctctcaa agtggtgaaa aggctgggat ttattttaaa agccacccca 240 

aaaaacacat ttccatctga tgtcatttac atttcttccc attttattta ttttaaaaaa 300 

ttcatgtccc taaagtaaga tgtagatgtg ccatatgtag tcatttaatt ccacgtagca 360 

ttcattttaa gtacctgttc cggctttata catgcttggc agttgactgc agggatggga 42 0 

actgtgtatc actgtagatt tctggtggtc cttgtggtgt tttccttgtt gtggttaacc 480 

tggccccagg aggccfcatac acaccactag tcatcgctgg ttctggggtt tctgtaacaa '540 

ttactggagc aggaggtgct tgtactggag ctgttttatt ccagggacct gaagattttt '600 

ccatacctcc accaccacct ccaccttctt cccagttatc acctggatct tgtctctttt 660 

cattatcgtc ttcttccttt tcactg 686 



<210> 23 

<211> 1528 

<212> DNA 

<213> Homo sapiens 



<400> 23 

gagatgaaga tgctaaggaa acccagatag cttccaagaa gaagctcagg gagataagag 60 

agaaggattg ggtggtgatc caggcactat ggtggggaaa ccggagacaa gtagcaagtg 120 

aaatttatag agaatttgag atactatgtt gggagaatca gaactggatt ggaggtgaac 180 

atagagcaga aattcaggca tcagagaaga gagaccacag aacgtggatg gatgtgagga 240 

tggcacatat attctgtctc ccgaggctga gatccaggaa caattaaaag gagaaactga 300 

tgfcggatact caaagcaatg acccacttag agaagaggat ggtacagaca ttcagtcact 360 

agggaggaga gaggttaaag gtgaggatga taaagacacc caggaacttg ggaggaaaaa 420 

tcagggtcag ttaggaaatg aatttagtgg aaagattcac ataccaaagg ggaagaatca 480 

ggaacatatt agaggcgaag atggtgcaca tacccagata tctgagtcag ggaactgggg 540 

caaattaaca agtcaaattg atggagaaat gcattcagca gaatggaaga aagatcagca 600 

gattggaggt gagaatgggg cagaaattca gatacaaggg aagagaaacc tgagagaagt 660 

tggaggtgag gacggtgtaa agacctgggc acctgggaaa gaaacccaga gtcagtttag 720 

aagtgatctt ggtagaaaga tccttttatc agagtggaag agccagaagc agatgggaag 78 0 

tgagaatgga acagaaattc aggctccagt ggagagaaac cagagagaac ctggaggtga 840 

ggacggtgta aagactcaga gacctaagag agagaacgag gaccagttag atagtgaaat 900 

tggagggagc cattcaccag ggaggaggaa ctgggagctg attggaaagg atgttgcaga 960 

aaatcaggca tcagagaaga gaaaccagag agaggttgga aacgaggacg agtggaagaa 1020 

ccaggaacag ggtggaggtg ggaatgatga agaaattcaa atacagggga agagaaacct 1080 
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gagagggacc acagctgatg atggtacaga gacccaggct cctgcaggag atgaccaggg 1140 

acagttaaga gttgaaattg ctgaagagat ccaggtacaa gggcaaggaa ataagaatga 1200 

cggtggagtt gaggatgttg cagaactcca ggatatagga agccagagaa agtgcacaga 1260 

tgaggatgtt ggagagcctc gagcaccaag gggaggaaac aaggatctgg tcagaggaga 1320 

ggatgctgtg agggacagtc tccaagtcga ctgttctggg agtgagaggc ccacaggcag 1380 

gaagcacagc ctaccatggc ctccagcctt cactggctat ggatgtggga cccgggaaca 1440 

ggaacaggca gtggctgtga atggtttcat ctctgccccc tgtcctgaga tgaatcctgt 1500 

cccccactgg ggtgaagtct tcctgctg 1528 



<210> 24 
<211> 7268 
<212> DNA 

<213> Homo sapiens 



<400> 24 

tttcgtctcc gcctcgcctg ccaccggggt ttgtatgaaa acaccgggcg gcgggcggcg 60 

agggatccgc cgtgatccag gtgctgagcc gggtgctgcg gctctgcgcg gtccccgcca 120 

gcgccccatc ctgagccgat tatctgcaat tatgaaatga agtaactcaa gatgagcaag 180 

ttaaaagtga taccagaaaa aagccttacc aataattcta ggatcgtagg actcctggct 240 

caactggaga agatcaatgc tgagccttca gaatcagaca ctgcccgata tgttacatca 3 00 

aaaattcttc atctggctca gagtcaagaa aaaacaagga gagaaatgac agccaaaggt 360 

tctacaggaa tggaaattct gctgtcaaca ttagagaaca caaaagatct tcaaactaca 420 

ctbaatatct taagcattct tgttgagctg gtgtcagctg gtggaggtcg aagagtgagt 480 

ttcttagtca ccaaaggtgg ttcacaaata ttgttgcagt tacttatgaa tgccagcaaa 540 

gaatctcccc cacatgagga cttaatggta cagattcatt ctattcttgc aaagattgga 600 

ccaaaagata aaaaatttgg agtaaaggct agaattaatg gggctctgaa -tataaccctg 660 

aatttggtca agcagaattt gcagaatcat cgcttggttc taccttgcct tcagctttta 720 

cgagtatatt ctgccaactc tgtgaattca gtatccttag ggaaaaatgg agttgtggaa 780 

ctgatgttta aaatcattgg accatttagt aagaagaatt ccagtcttat aaaggttgct 840 

ttagacactc ttgctgcatt gctaaaatca aaaacaaatg ccaggagagc tgtagacaga 900 

ggatatgtcc aagtgctttt aacaatttat gtagattggc accgccatga taaccggcat 960 

agaaacatgc tcattcggaa aggaatttta cggagtttaa acaagcgtct acaaacatca 1020 

tgttgggaag aaaagcattt attgatgcca atgggatgaa aattctgtat aatacttcgc 1080 

aattgcctgt tattcctgtg actggtcctg tggctcagct ctacagctta cctcctgaag 1140 

tggatgacgt agtagatgaa agtgatgaca acgatgatat tgatgtagaa gctgaaaacg 1200 

aaactgagaa tgaagatgac ctagatcaaa attttaagaa tgatgatatt gaaacagata 1260 

ttaacaaact aaaaccccag caagaaccgg gacgaacaat agaagatcta aaaatgtatg 1320 

aacacctttt ccctgagctt gttgatgatt ttcaggacta tgatttaatc tccaaagaac 1380 

caaagccttt tgtatttgag ggaaaagtac gtggtcctat tgttgttcct acggcaggcg 1440 

agggaacatc tgggaattct ggcaatttaa gaaaagttgt aatgaaggag aacatatctt 1500 

ctaaaggaga tgaaggtgaa aagaagtcta cctttatgga tctagcaaaa gaagatatta 1560 

aagataatga tagaacatta caacagcagc caggtgatca aaatagaact atttcatcag 1620 

tccatggttt aaacaatgat attgtaaagg ccttggaccg aattacattg cagaatattc 1680 

cttctcaaac agccccaggt tttactgcag aaatgaagaa ggactgcagt cttcctctta 1740 

ctgtccttac ctgtgctaaa gcatgtccac acatggctac ttgtggaaaa bgttctgttt 1800 

cgagggaaga acagttcagc tagggaagct ttgctgcact ggagttgaaa ccgaagatga 1860 

tgaacgatac tgagtcaaat tcatcggtag aacaagcatc ggttgaagta cctgagtgga 1920 

ccaacactcc atgacccaga cctctatatt gagattgtga aacatacgaa gtctgtccca 1980 

gaatattccg aggtggctta tcccgattat tgtggtcaca ttccgcctcc attcaaagag 2 040 

cccattttag aaaggcctaa atggtgtacc aaaggacaaa aattgctcaa ggatattgga 2100 

aaggctaata acatcagagg tgatatcata gatcgtgtgg tatatgactt ggctaacccc 2160 

aaattacacc attccagaag agggagatct cttgaaattt aactccaaat tagagtctgg 2220 

gaatctgcgc aaagtaattc aaaattagaa aaaatgaata tgatcttatt ctgaactcag 2280 

acataaacag ccaatcatta tcatcagtgg ttttactttg aagaaagtgg aaccgcgacc 2340 

aggtggtgct tacaggctta acatcattaa ctgtgaaaag gccaacagtc agtttaatta 2400 

tgttatgcat cccactcatg tattcggttc aggaagcatt aaatgccata ccatggtgga 2460 

ttcgtatggt gactgacatt tgttactata aaaatcattt ctcaagaagt tcagttgctg 2520 

caggtgttcc aaagggaaaa tcctactata caattacatt tactgtcaat tttccacata 2580 

tagatgatgt ttgctactat gcttatcact atccatatac gtattcaact ttacagatgc 2640 

atcttcaata attggaatca gcacacaatc ctcagcaaat ctattttcgg aaagatgtgt 2700 

tatgtgaaac ccttgtctgg aaacagctgc cccttggfcga ctataacagc aatgccagag 2760 
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tctaattatt atgaacatat ctgccatttc agaaatcgcc cttacgtttt cttgtctgct 2820 

cgggtacatc ctgggagaaa ctaatgcaag ttggggttat gaaagggaac gttgggaata 2880 

tctccatgag gcaattaacc ccactggctt caggagctta cggaggaatc tttattattt 2940 

tttaaaattg tccctatgtt aaatccagat ggtgtcatca atggaaatca tcgctgttct 3000 

ttaagtggag aggatttgaa taggcagtgg caaagtccaa gtccggattt acatcctaca 3060 

atttaccatg ctaaggggct gttgcaatac ttggctgcag tgaagcgttt acccttggtt 3120 

tattgtgatt accatggcca ttcccgaaag aagaatgtat ttatgtatgg ttgcagcatc 3180 

aaagagacag tgtggcatac caatgataat tcgccttcat gtgatgttgt ggaggatacg 3240 

ggatacagga cattgcctaa gatactgagc catatcgccc cagcattttg catgagcagc 33 00 

tgtagcttcg tagtggaaaa atctaaagaa tccacagcac gtgttgtagt ttggagggaa 3360 

ataggagtac aaagaagtta taccatggag agtactttat gtggctgtga tcagggaaaa 3420 

tacaagggtt tacagattgg tacccgagaa ctggaagaga tgggagcaaa attttgtgtt 3480 

ggtcttttac agtttgaaaa gactgacctc tccattggaa gaataatctg ccctcccagc 3540 

cctgcttgac tttgaaaatg atttaaattg aatccaagcc tgcaaagtaa ctagccccta 3600 

cccacttatg tccttggatg aagatgaacc ctcgattccc ttgaagaagt tgattacagt 3660 

gcagaaagta atgatgagtt agatattgag ttggctgaaa atgtaggaga ttatgaacct. 3720 

tctgctcaag aagaagtact ttctgactct gaattatcaa gaacatacct accttgagcc 3780 

cgctcgccat ctctttgtta actgcaaaga ataaatgaaa tatcttggtg ttttattctc 3840 

ccaggaagct tgagagaaat gagtttatac agagctgact caaaaagaca aaaagtaact 3900 

tgggccagtt tggtttcaag ataataaatg tgttattaat taatgattaa aattggcgcc 3960 

tggttttatt ttcggatatt ccaatgcact ttatggtagc cattgaatga tccaaatatt 4020 

ggatttaccc tttaaaaaaa aaaaacctga gtatcattgc atgaattttt atctccctat 4080 

ggttatatcc tgcatcaagg gggataattt tgaagtgtgt tcagaatata aaattgaaat 4140 

tttagagttg ttgaaaatcc tgacttgttg aaaactaata tatatgtaca tggatttcta 42 00 

tagatgtgtt tgtttagaag tgggtagata ttgcagataa gactgttctt cagaatcatg 4260 

ttaactattg ggttgtgact gaagtagtcc agggtttgcc ttgaaaccat tacattctac 4320 

atttaccaaa ttaaacaaat aaaaactgta ttaaatgttg cattcatttt gtcatcttct 4380 

ttaaccagct cagattattt gatgtataga actttgtgag aatgtgataa aaacccagaa 4440 

ttggacacag tgataaaaag ttgtttttaa gaaagtgctg ggattacagg cgtgcgccac 4500 

cgtgcctggc taaatccttt cttgtaatga aacctacctg ggttttagcc ctcttgctgc 456 0 

aagaatgact ttatttttaa ataaaataaa gcaaatccat tgactttgga taagtgtttt . 4620 

aagaactaag tttctgatat aataaaacaa cctccaaaag atacctcatt gcagattttc ■ 4680 

gccaggattc aggtgcttag gttttggcac atatcttcct cactttttcc tatagtgttt 4740 

ttaacctgtt ataattggaa tgatagagat gtatttcaga attttgcatg gtccaaatca 4800 

atgatttttc tttttaatat gtaaagtatt tcattaacag tgtagattct ttctgatgtt . 4860 

tggggagcca tgtttaaaaa tgtagtatgg agcaattgaa aaaggggtct ttttccctag 4920 

cctaattctt actaatctca gaaaacaaag atcaaataga ctgtgaagtt gaactagtcc . 4980 

tcttagtgta gtaaccaaat ttaggaaaga tcaggataat ttaggaagga aagtaatcat •. 5040 

ttatttaatt atttataatt ataaacaatt attttatgtt ttgtatattt tatgtataaa 5100 

acaattacat gttttattat agttatactt cttcagagag tagattatgg agccaccatg 5160 

tccatacagt tagttgttcc ttggtattct tggattgatt ccaggacccc ccacagttac 5220 

ccaaatccac agatgctgaa gtctctgata taaaatggag tagtatttgc atttaaccta 5280 

tggacatcct cctgtatgct ttaaatcatc tctagattac ttataatacc taatacaata 5340 

taaatataaa tgctctgtaa atagttttat tgtatagtgt gtgtgtgtgt tttaagagac 5400 

gaggactcac aatgggacac tttgaccagg ctggtctcga actcctgggc ctacacaatc 5460 

ctcccacctc ggcgtcgcaa gtagctggga ctacaggtac tcaccactat gcctggcagt 5520 

tttttaaatt tgcgttttta aaaattgttt attatttatt gaatattttc tacctgagat 5580 

tgacttaatc tgtggatatg gaacctatag atatggaggg ctggctgtat atgaatttgt 5640 

ttttggcttt tacaaaatta gaaattaaaa gtaactacag ttactataat gtgcagaatg 5700 

actcaaatat ttccatttat tatgttagtg taaaaactcc taactacgta ttataatatt 576 0 

ttatattaat gaagtacttt agcatacatt atgagcctga ataaagcttt ttattaaata 5820 

ttcacaagat tgaaggagaa attctttata gttaggaaat gattttttca taaacaataa 5880 

atagctttga aaatataaaa aaaaatttag ggctgggtgc agaggctcat gcctgtagtc 5940 

ccagcactct gagaggccga ggcgggtgga tcacttgagg tgaggagttc aagatcagca 6000 

tggccaacat ggtgaaactc cacctccacg aaaaatacaa aaattagcca ggcatggtgg 6060 

catgcgcctg taatcctagc tacttgggtg gctgaggcat gagactcact tgaacctggg 6120 

aggcagaggt tgcagcagtg agccgagatc aagccgctgc actccagctg gggcgacaaa 6180 

gcaagactgt ctcaaaaaat aaaaacaaaa aaattaaaat aaaatcctaa ctttcctagt 6240 

tagaaatata attaactttt attggctatt agtagctaaa attgccctta tttttgcttc 6300 

tgtggcctca ttttttttct cggttattca gtatttaaat tatggcttag atgtcttcaa 6360 

tgctatagtc ctgaaaatta atgatctacg ttgtgcagca gtgatgggat ctgataaact 6420 

gagtctgatg atggagttta gatttatttc ctttaccttg catttggaac tttgaatctc 6480 

ttaaatgtgg ttttggtcta gtccactggt tctcacgttg taatgtgcat cagaatcttc 5540 

tgaagacaca gattgctgat tatgtagacc tggggtggag gtccaagaat gtgcattcct 6600 
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tgcaaattcc caggtggtga tgctgcagag ccacactttg agaatcacta gtataatcca 6660 

gtatgctctt ccagcatcac accttccttc ctgagaatca caggagttgt gaactgcaga 6720 

ttagcattgg ggagaattta gatcaaacta atttgtagaa tcaaggaggt caagtaaggt 6780 

cacaggggca cttgggttga gccagggttt tagcccaggt cttctgacaa ctgcctcatg 6840 

tccttaccac aaaggagctg ctatcctttg cctttcccca aagagtgaag actgcttaaa 6900 

gctcaaggat ctttcttgaa tttgtgaaat ttgttcaggc aaggtgaaaa gcaaaaacct 6960 

atggttcaca ttgacttttt gtattgatca ttgtcttttg aagacaggaa gtatgatcag 7020 

tctctgccac ttgtgctagt ttttgtgtgg tgtttagaaa catgggcatt tgtctggatc 7080 

ctaattacaa ataagtaacc tagaattctc ttcgagatag tggactaacc accgatgaat 7140 

ctattctaaa atttcaaata tccaaattaa aatggactgt attaggcatt aaggtactgg 7200 

aaatgggata atacattaaa tgtattaatg gaattgtttt tgtgcatgat acagaaataa 7260 

atgatagt 7268 



<210> 25 

<21L> 3378 

<212> DNA 

<213> Homo sapiens 



<400> 25 

atggatccgt ttacggagaa actgctggag cgaacccgtg ccaggcgaga gaatcttcag 60" 

agaaaaatgg ctgagaggcc cacagcagct ccaaggtcta tgactcatgc taagcgagct 120 

agacagccac tttcagaagc aagtaaccag cagcccttct ctggtggtga agagaaatct 180 

tgttcaaaac catcgccatc aaaaaaacgc tgttctgaca acactgaagt agaagtttct 24 0 

aacttggaaa ataaacaacc agttgagtcg acatctgcaa aatcttgttc tccaagtcct 300 

gtgtctcctc aggtgcagcc acaagcagca gataccatca gtgattctgt tgctgtcccg 360 

gcatcactgc tgggcatgag gagagggctg aactcaagat tggaagcaac tgcagcctcc 420 

tcagttaaaa cacgtatgca aaaacttgca gagcaacggc gccgttggga taatgatgat 48 0 

atgacagatg acattcctga aagctcactc ttctcaccaa tgccatcaga ggaaaaggct 540 

gcttcccctc ccaaacctct gctttcaaat gcctcggcaa ctccagttgg cagaaggggc 600 

cgtctggcca atcttgccgc aactatttgc tcctgggaag atgatgtaaa tcactcattt 660 

gcaaaacaaa acagtgtaca agaacagcct ggtaccgctt gtttatccaa attttcctct 720 

gcaagtggag catctgctag gatcaatagc agcagtgtta agcaggaagc tacattctgt 780 

tcccaaaggg atggcgatgc ctctttgaat aaagccctgt cctcaagtgc tgatgatgcg 840 

tctttggtta atgcctcaat ttccagctct gtgaaagcta cttcttctcc agtgaaatct 900 

actacatcta tcactgatgc taaaagttgt gagggacaaa atcctgagct acttccaaaa 960 

actcctatta gtcctctgaa aacgggggta tcgaaaccaa ttgtgaagtc aactttatcc 1020 

cagacagttc catccaaggg agaattaagt agagaaattt gtctgcaatc tcaatctaaa 1080 

gacaaatcta cgacaccagg aggaacagga attaagcctt tcctggaacg ctttggagag 1140 

cgttgtcaag aacatagcaa agaaagtcca gctcgtagca caccccacag aacccccatt 1200 

attactccaa atacaaaggc catccaagaa agattattca agcaagacac atcttcatct 1260 

actacccatt tagcacaaca gctcaagcag gaacgtcaaa aagaactagc atgtcttcgt 1320 

ggccgatttg acaagggcaa tatatggagt gcagaaaaag gcggaaactc aaaaagcaaa 1380 

caactagaaa ccaaacagga aactcactgt cagagcactc ccctcaaaaa acaccaaggt 1440 

gtttcaaaaa ctcagtcact tccagtaaca gaaaaggtga ccgaaaacca gataccagcc 1500 

aaaaattcca gtacagaacc taaaggtttc actgaatgcg aaatgacgaa atctagccct 1560 

ttgaaaataa cattgttttt agaagaggac aaatccttaa aagtaacatc agacccaaag 1620 

gttgagcaga aaattgaagt gatacgtgaa attgagatga gtgtggatga tgatgatatc 1680 

aatagttcga aagtaattaa tgacctcttc agtgatgtcc tagaggaagg tgaactagat 1740 

atggagaaga gccaagagga gatggatcaa gcattagcag aaagcagcga agaacaggaa 1800 

gatgcactga atatctcctc aatgtcttta cttgcaccat tggcacaaac agttggtgtg 1B60 

gtaagtccag agagtttagt gtccacacct agactggaat tgaaagacac cagcagaagt 1920 

gatgaaagtc caaaaccagg aaaattccaa agaactcgtg tccctcgagc tgaatctggt 1980 

gatagccttg gttctgaaga tcgtgatctt ctttacagca ttgatgcata tagatctcaa 2 040 

agattcaaag aaacagaacg tccatcaata aagcaggtga ttgttcggaa ggaagatgtt 2100 

acttcaaaac tggatgaaaa aaataatgcc tttccttgtc aagttaatat caaacagaaa 2160 

atgcaggaac tcaataacga aataaatatg caacagacag tgatctatca agctagccag 2220 

gctcttaact gctgtgttga tgaagaacat ggaaaagggt ccctagaaga agctgaagca 2280 

gaaagacttc ttctaattgc aactgggaag agaacacttt tgattgatga attgaataaa 2340 

ttgaagaacg aaggacctca gaggaagaat aaggctagtc cccaaagtga atttatgcca 2400 

tccaaaggat cagttacttt gtcagaaatc cgcttgcctc taaaagcaga ttttgtctgc 2460 

agtacggttc agaaaccaga tgcagcaaat tactattact taattatact aaaagcagga 2520 
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gctgaaaata tggtagccac accattagca 
ctgacattca ctactacatt tactctgcaa 
gaagtttaca gcttggtgca aaagaaagat 
tccaagtcca aggctattac tccaaagcga 
attcattctt cagtcatggc cagtccagga 
gcccttgttg gatctfcacac attatcattg 
gacaaggtcc cctttttatc ttctttggaa 
gtgaattcca gtgttgaaga aagaggtttt 
ggtgcctggc atcgaagatg gtgtgttctt 
ccagatgatg agaaacgcaa gaatcccata 
cgtcagatag aaccagccaa cagagaattt 
actgtccgac cacaaagaga agatgaccga 
ctctgtgtta ccaagaactg gctgtctgca 
caaaaactca atcaagttct tgttgatatt 
cctattggaa agccttaa 



agtacttcaa actctcttaa cggtgatgct 2580 

gatgtatcca atgactttga aataaatatt 2640 

ccctcaggcc ttgataagaa gaaaaaaaca 2700 

ctcctcacat ctataaccac aaaaagcaac 2760 

ggtcttagtg ctgtgcgaac cagcaacttc 2820 

tcttcagtag gaaatactaa gtttgttctg 2880 

ggtcatattt atttaaaaat aaaatgtcaa 2940 

ctaaccatat ttgaagatgt tagtggtttt 3000 

tctggaaact gtatatctta ttggacttat 3060 

ggaaggataa atctggctaa ttgtaccagt 3120 

tgtgcaagac gcaacacttt tgaattaatt 3180 

gagactcttg tcagccaatg cagggacaca 3240 

gatactaaag aagagcggga tctctggatg 3300 

cgcctctggc aacctgatgc ttgctacaaa 3360 

3378 



<210> 26 

<211> 3688 

<212> DNA 

<213> Homo sapiens 



<400> 26 

aggtaggggg agtaatagga gaagcfcacga 
atcaagcagg agcagtttgc gtctatagaa 
gcattcgaga ttatgatttg gcaataatac 
aatttgtcaa agacttgcgt actgaggcat 
gtaaaaattt agtcaccact aactttgaaa 
aaaaaaatca aacatgaaga ttgcagtttt 
aactgacttt ggaaaaaatg aagaaattcc 
aaggttgagg aaaagttcaa cctcacacaa 
gcaaacaaca gtttttacac cagatagcaa 
tggaggaaaa gtttgaatcc ttttcaagtt 
taccaggaaa gaagggacac ttgttgggaa 
ggggcggctt aagccctgca ctacctgcct 
aaccatgtgc catccccaga ggtgccccca 
ggtctgctcc gctactgtct cctattctct 
tgaattttct ggtgattctt cagaacaaag 
gccacctcct caggtgggaa tggaccgaat 
ggatgaagaa gtgaaagaag aagatacaga 
tcaggggcaa ctttacagtg agggggacag 
agaggagagg aggctggcac accagcaaca 
ggaggaggag ggtgaggagg gtgaggagga 
agatatgttc cgaatgccct ctcgatcccc 
cctgccaagc gggtgctctc tgtcctacag 
tgcaaatacc accgctgaca gcaccacaga 
tccatctgcc tccatctcca gatgaagcat 
tgatctgagt aaaaataata tcactttctt 
tctgaagaag ttaatgcgtt tgaatatgga 
attgccatct acattagaag aacttaaagt 
aaaaaggtta ttcaaactta aatcagttgg 
gtgaagccaa tgtcaatcct ttagctttca 
tgggaaaaaa taaatttaga attataccgc 
acctagaaaa taaccaaatt gaagaaataa 
tcaatgtcat tgtactacgt tataacaaaa 
ggataaatca agaaaatcta gaatccattg 
ccgtcctatc tacccaagtc cttgctgcac 
atccctggct atgtgtttgg ccacatggaa 
aacaagactt gctgatgatg gcatggaccc 
tgagagaatt atttctggga tcacaatgga 
gaaatgaaag cactacattt tctgaggctg 
agaagaaatt tgcaatgctg aagaggatga 



gagcgggaca cggtcaggta gatgtggaag 60 

aaaatcaaaa gtcatttaga tctgatagaa 120 

tatagcatat agtccatatg tcataagtgt 18 0 

acaagcaaat atataagcca attttcagaa 240 

tattgtttca ttttagaaat tggctggtgg 300- 

gttttgtttt tttctgctta tcatttttca 360 

taggaagcaa aggaggaaga tctaccacag 420 

gcacagatca aacagacagc ttggaattca 480 

gacttcctac tgttaacttt gattatagca 540 

ttcctggagt agaatcaagc tataatgtgt 600 

aagggcttaa ccatgtacaa caaagctggg 660 

ctgctcagat ggaagagttc tttgtgatga 720 

aacagttata cctgaagggg aatgctgccc 780 

actcagtggt atagcattaa atgatagaaa 840 

agaacctacc aatttacttc ataagcaact 900 

agtaagaaaa gaagcacttc aatctgagga 960 

gcaaaagaga gagacccctg aatctagaaa 1020 

cagaggagga gacagaaagc agaggcctgg 1080 

acgccaagga agggaggagg aggaggatga 1140 

tgaggaggac gaggaggacc cggtaagagg 1200 

gcttcctgct cctcccagag gcacactgcg 1260 

gaccatcagc tgcatcaacg ccaagcttac 1320 

taacaagtct gggagcctca cttggcaata 1380 

ttatatggat taccaaattt ggaaagggct 1440 

tcaggcatag gtccaaaagc attcaagcgt 1500 

tggaaataat ttgatacaga ttccttcaca 1560 

caatgagaac aatcttcagg ctatcgatga 1620 

tcaccttaga attggaagga aacaatctca 1680 

aacctttgaa gagcctagcc tacttgcgtc 1740 

agggtcttcc tggttctatt gaggaattat 1800 

ctgaaatttg tttcaatcat accagaaaga 1860 

ttgaagaaaa taggattgct cctttagcct 1920 

atctctccta caacaagctc tatcacgtcc 1980 

ctagtactcc ttgggaacca gattgaacgg 2040 

ccaggcctgg aatacttgta cctgtcattt 2100 

gtgtctcctt ctatggggca tatcattctc 2160 

cttaaaatct ataccacctg gggataccaa 2220 

aacaacaaca agatacggga acattcttcc 2280 

tgactcaaat ctgggaacat cttcatcttg 2340 
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aaaacaattt atattaaaat taggggaaat accatcttac acattttcca tgcataagat 24 00 

catactccaa gtatcgttct taaaccacaa aacatcaagt aattccaagt tttcctttgc 2460 

tgtttataaa ctttactcat gtatttgtag tagctgcatt tgtcattaat aagagagaca 2520 

taatcctcct gttatactca gtatcattat atgctagtca acctgattca ctaacacaca 2580 

gatgaacaac caaaatatac ctaaaaggta tagtctctag gagttttatt aatagtaaag 2640 

gtaaaatctc tcagtttcct acctctagaa agaggccatc tcactagaat aggatattat 27 00 

gcatactgag ctagaccaga agagtctgga acaaaataaa cacagccttt ataatcaact 2760 

tgaatactgg tgttagctga gaactctgta agtcccttta aaaattatgt atcttttggt 2820 

tcaagattaa gaagcataat gacaacaaaa aaagcaggca gaatttatga ataagtgttg 28 80 

tttattatta aaacaataat ttgttaattt cttataaggt cctgtgctat aattactggt 2940 

ataaatataa ctgaatattg gggtagcttt catttcttcc attaattaca tgtgtaaaat 30 00 

taaaacacta ctgtaatgtt aatttcctgg ttttggaaaa tcatattatg gttatgtaca 3060 

ttgttaacat taaggaaagc tggcagaagg gtgtgcataa attctatact ctttttgaaa 3120 

cttttctata attctaaaat tattttttaa agttaaacag tatattaaga ttaccttcac 3180 

tattcctcac tcaagattaa gacatttttt gaaaagcagt agagtttgct taaaatacaa 3240 

attaattatt cttgactata accttgtaaa ggtaaatcta atgtataaat ttttgaaaaa 33 00 

ttttgcacca ctggtcatag catctatctc ctttgcctta atttactgaa atacatcatt 3360 

ttattcggtt caattgaaat aaagctatgt ctttactatg tattggccct accaaaatca 3420 

tatattaaaa ttttctaaca taatgatttt tatctttgta ttgctaaagt aatttataag 3480 

acacatatag aaactatgtt gacttctttt gttactctca gttttatgtc ttggctagga 3540 

acttaagtgc cagtgttggc ctcattgcat aggtttctct aattttttaa gctttagtta 3600 

tttaatcatt taatagttct ttcaccactc taggaacctt ccatccccac ttcacaaagc 3660 

tattataaga actagaaaag aatgtaaa 3688 



<210> 27 

<211> 580 

<212> DNA 

<213> Homo sapiens 



<400> 27 

tcatgcttta gtcagttccc actctagcct tccctgcatg gctctctgcc taggagtgtc 60 
ctccccattt cctgcttctt gaccatctag gtactcaggt cccagaattc tctgctcaaa : 12 0 
tgatggaagc ttgttcctgg gacctgggtt ctcattcctc aggtttgttc tctcatggac . 180 

agactgccct gtggcatgca tcattctttg gcctgaggat gtttctcttg caggtgggag 240 

agtgggaaga gtctggacgt ggcagctgag gtatctgcag atgtacagag cttctgagag 3 00 

tgcaaagtgg agcttgggag ggggagaaga gcagcaaaag ccttggctga gactgactca 360 

tcaaggactg ccatgtcccc cagaggaagt ctcaggaatc cagacattct catcagatcc 420 

ttgccttcca ggtggcttgg aagctcatgc tctaaagcaa gtcagtgcaa tcaactccag 480 

aactagacca aaagacaaaa tcctggagca aaaacatagg ccctcattgg atcaaggaaa 540 

acagctctca atgaaaaaag aaaagatccc tgttggtggg 580 



<210> 28 

<211> 737 

<212> DNA 

<213> Homo sapiens 



<400> 28 

atgagtaggt ggggagcggc ggtggggcag ggcgcactgc gagaagagca cttcgcgcat 60 

gcgcatatca ccgagcgaac gcgccgtgtt cgggaaggta ggagaaagag gcggagctcc 120 

ttgttaacga catccccaac ctctgctaac gcgcaggcgc acttcctgaa gctgaaggtc 180 

agcatagaca aagggcctca gaatcgcgca ggcgcaattg tgccctggtt cgccaagatg 24 0 

tcgttcccaa agtataagcc gtgcgagcct gcgcactctg ccgtgagacc ctcgacccag 3 00 

ccgaatacaa cat at etc eg gaaacccggc gggegcaage eggageggtt ggeccatagg 360 

agcccagctg caaacgagag tacactgett cagtacaacg atcccaacac gecgaggget 420 

catcgaaaat ccgtgccttg cttcgttggg ectatgeaag aacaataaat gtttattcct 480 

aatttcaagc cccactccta aaagctcact catgggagct tttgtgtggg attttgggee 540 

cctcatcttc attttattat attatcaaaa ctgagaggga taggaaagaa aaacttatcc 600 

aegggaggaa aattgggatc agagacattt tcacctactc atattaagtc tggcaatgat 660 
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gaccattatg aatcctgccc taaataaatc atctattaat cattaaaaaa aaaaaaagaa 720 
aaaaagaaaa aaagtcg 737 



<210> 29 

<211> 585 

<212> DNA 

<213> Homo sapiens 



<400> 29 

atgcagttcc gcagggcacc attcatgtat tctgtgcgaa tggagcttgc aggagttcca 60 

tgcgaattga catgttttct tccccaaggc atttgtttgt tgatggtacc agctgtcaaa 120 

aaccaagcaa gtggcagtgc tagaggtgcc acaaaagtgc gaaggaagtg tcaggctgga 180 

tgtcaaaatg aacaccttgg agaactggat gatggaacag acggtaaaaa tcagctaaac 240 

atcagagaaa atggaggaag aggtcaaaac tgtgaacagg aactagaaga aagtgtagca 300 

gaaaaagact tgtcacaaac ttcgagagat ttggagaaaa tgatgtcaaa acacatcttc 360 

ctcaagccca tgctgagtat ctctgatttg gttaatttct tgatgcaagt ttccaaagtt 420 

ctggtgaaaa cggctgaagg aatagtgtta cagcaactgc ccttggcttt tccagcttta 480 

cacttccatg cctatggaaa tctcttccct gtctgtagct ttaagcatta catctatatg 540 

atcgaccatc ccatcttcat ctccatccct gacttcctaa cttga 585 



<210> 30 

<211> 2610 

<212> DNA 

<213> Homo sapiens 



<400> 30 

ctctgcagaa tccccgggtc gacgatttcg 
cgaaaccatt tagcatggtt accaggaatg 
agaaaacatt cagaatccaa aggaaagctt 
atcgtggaag tggttttcat cacaggagag 
catcatgaga atgcggagta tggcattcgc 
aggagatcca accgggcaga tgaattgacc 
ctggagagga atgccgccac cgctgggaac 
aatattggtc gttctcagag ctgggatgct 
gccgagtccc tgcggcctct ggggaggaga 
acttggtttg agccaaacca ccgaccacag 
tacctgtacc gggaggccgt ttataactca 
ttcaccttct acgacagcag acaggcagtg 
gagtactaca gtgatccgtc tggagctgct 
gacccaggag tcagccggcc ggtccccagc 
gatccaatgc aaggccggtc acctgccctg 
ggaggtaaaa tgatccctca ggggcggcag 
tatggacggg agcagcccga caccaggtat 
caggtcttca gcgacatcag cgaaagaccc 
ccgacgtgcc tggttgtgga ccccagttca 
gccccagggg ccctgaatcg tggctacggg 
gcttacgaga cttacgaagc tgacctgtcc 
gtgctccctg agttcctggc ctttctgcgg 
gccctactgc agcagggctt tgactccccg 
atcaagtctg tcgcacccaa cctgggccag 
tgcaggaccg aaatgcagct gcggaggcag 
tccagcagct tcagccaccg aagcgaactg 
gcggctcctc tgcagacagc atccccccga 
gcaccatctc agcacctcct ggagacggca 
catgccccac acttcccctc caactccggg 
gcgcgtctga gccccacgta ccccctgcag 
tcaaaccccc ttcacccagg ccccagaaca 
gagctgctaa agagggagcg caacgtggcc 



taggatcact taaatgctca ggtacagtat 60 

gagggtgagc cgactctgtt taaaatctgc 120 

gtaagtaagt acttctccat ggagtgctta 180 

aggattgcaa tttctaaatc agtctccctg 240 

aggactgaat ccctagattt taagtttgga 300 

ggtggtgaat attctgtagc attttcctcc 360 

cgtggactgg catttcccag caggcatata 420 

gctgggtggt acgagggccc ctgggagaac 480 

agctccctga cgtatggcac agcagagggg 540 

gacgctgccc tgcccgtggc cgccgagccc 600 

gtggctgcaa gaaaggggtc tactcctgac 660 

atgtctggtc gcagccccct gctgccacgg 720 

agggtaccca aagagcctcc cctctatcgg 780 

tacggagtgc ttggcagcag aacgtcatgg 840 

caggacgccg gtcacctgta ccgggatcct 900 

acacagagca gggctgcatc tcccgggcgg 960 

ggggcggagg tgcctgccta ccccctcagc 1020 

attgaccctg cccctgccag acaggtggcc 1080 

gctgctgccc ccgaaggcag cacaggggtg 1140 

cctgcccggg aaagcatccc atccaagatg 1200 

accttccagg gtcctggtgg caagaggacc 1260 

gcggaggggc tggcagaggc cacgctggga 1320 

gccgtcctgg ctaccctgga ggacgcggat 1380 

gcccgtgttc tgagccgcct ggccaatagc 1440 

gaccgggggg gcccgctgcc ccgggcgcgg 1500 

ctccatggtg acttggcttc tctgggcgct 1560 

gctggagacc cggctcgccg tccctccagc 1620 

gccacctatt ctgcccctgg cgtgggcacc 1680 

tacagctctc ccaccccttg cgccctgaca 1740 

gcaggggtgg ccttgactaa cccgggcccc 1800 

gcctactcca cggcatacac ggtgcccatg 1860 

gcctctccgc tgcccagccc tcacggcagc 1920 
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ccccaggtct tgcggaagcc aggtgcaccc 
cagagcctcc acacgcctca ctcaccgtac 
atcatcgtgt ccaccatgct tgcaccagaa 
agtgacaacg tgaggaaggc gtatgctgcg 
gacacagaca aatgggggct ccaggcccgc 
tcggctgtcg ctctcaggct gacctggccg 
ggtggctgcg agccagcaat ctccccgtgt 
cagatgcagc agggatcatc agaaacaacg 
tttgtcttca caaaatacag ccaagcctgc 
cctgtcacca gagctggtca ctctcatggc 
ctggaggtgt gaacgctgat ctcctaggta 
atcagaatct gtatagaggg tcgccgccat 



ctggggccat ccaccctgcc gccggccagc 1980 

cagaaggtgg cccggcgcac aggggcaccc 2040 

ccaatccagt ttgctggcca agcagtccag 2100 

gggaccccag tgcggcccac cagccccggt 2160 

gtgagtggca gcacctggca ggtggtgggg 2220 

gctatggcac aagtcgcgga gccctccggc 2280 

cacgtgctgt cacccgagcc ctgcctgcac 2340 

aacgaatggg gctgcggcca ttttcatatc 24 00 

agcctgcaca gggctcagct caggacacac 2460 

ttcttttcgt gtggcctcgg tttccagcaa 2520 

gaaagcacct ccatcaataa ttttagccct 2580 

2610 



<210> 31 

<211> 797 

<212> DNA 

<213> Homo sapiens 



<400> 31 

cccggaattt ccgggtcgac gatctcgtcg gcttctcgga agtaatttaa gtgcacaaga 60 

cattggtcaa aatggtttcc aaaagaagac tgtcaaaatc tgaggataaa gagagcctga 12 0 

cagaagatgc ctccaaaacc aggaagcaac cactttccaa aaagacaaag aaatctcata 180 

ttgctaatga agttgaagaa aatgacagca tctttgtaaa gcttcttaag atatcaggaa 240 

ttattcttaa aacgggagag agtcagaatc aactagctgt ggatcaaata gctttccaaa 300 

agaagctctt tcagaccctg aggagacacc cttcctatcc cacaataata gaagaatttg 360 

ttagtggcct ggagtcttac attgaggatg aagacagtct caggaactgc cttttgtctt 420 

gtgagcgtct gcaggatgag gaagccagta tgggtgcatc ttattctaag agtctcatca 480 

aactgcttct ggggattgac atactgcagc ctgccattat caaaacctta tttgagaagt 540 

tgccagaata tttttttgaa aacaagaaca gtgatgaaat caacatacct cgactcattg 600 

tcagtcaact aaaatggctt gacagagttg tggatggcaa ggacctcacc accaagatca 660 

tgcagctgat cagtattgct ccagagaacc tgcagcatga catcatcacc agcctacctg 720 

agatcctagg ggattcccag cacgctgatg tggggaaaga actcaggtgg ataaaccctc 780 

tgtcatcatc taagtga 797 



<210> 32 

<211> 5933 

<212> DNA 

<213> Homo sapiens 



<400> 32 

gacgcgcctg agccggagtt tcagataaca 
acacatggtg gagagaagac cttatctgga 
cagaggccct gtggatggag agttaccacc 
agccaatgct ctccgaggtg gagccagcca 
tcctgctgct tactgtcaga gggaagagag 
aggaagtgag gaacaggagt ggcatgtctg 
ggggaatgac accaggtgga gaaatggcaa 
gtccaatgac aacttccagc agtggcggac 
gcaggcgaag aaactgggct acaagttctt 
ggtggtcatc acacttgcca caagtttagg 
gaaatctaac ttccttgagc tcatctgtca 
ggatcgccag agtgttctcc atgtactggg 
ctgcctgcct gcttatgtgg tagggatgat 
gtatccagag cacataagca acatcatctc 
tgccagctct gtgcaggaaa cttccatgct 
tctgagagcc tctggtgttg acatagaaga 
gactatcatt gaacatctgc aggaaaagag 
cactctagtg caagcagagg aacgaggacg 



taaagtaacg tgtaggaatt caggtgactc 60 

tgccaggccc aggaattccc ataccaacca 120 

aagagctaga aatcaggcca ataacccacc 180 

ccctggaagt gatcctaggg ccaacaacca 240 

atttagggcc atgggctgga acccacatca 300 

cgacgaagct agagaccaaa gacattgcca 360 

ccaggactgt aggaaccgca gaccaccatg 420 

tccccaccag aagcctacag aacagccaca 480 

agaaagtctt ctgcagaaag acccttctga 540 

gctgaaagag ctcctttctc attcttccat 600 

ggttcttcgg aaggcttgta gctccaaaat 660 

catattgaaa aactccaaat ttctcaaagt 720 

cactgaaccc atccctgaca tccgaaacca 780 

cctcctccag gaccttgtaa gtgtcttccc 840 

ggtttccctc ctgccaacct ctcttaatgc 900 

ggaaacggag aagaacctgg aaaaggtaca 960 

gcgagagggc actttgagag tggataccta 1020 

catgttgaga gctacgctga ccatgcccag 1080 
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ataccctacc tacactgaag cgcacttggg tgaagaggcc cttccttcgc cccaatatca 1140 

tttctggaaa atacgacagc actgctatct atctggatac ccacttccgg cttctgggag 1200 

aagatttcgt cagaccttta cgggaaggta ttttggaact tctccaaagc tttgaagacc 1260 

agggcctgag gaagagaaag tttgatgaca tccgaatcta ctttgacacc aggattatca 1320 

cccccatgtg ttcatcatca ggcatagtct acaaggtgca gtttgacaca aaaccactga 1380 

agtttgttcg ctggcagaat tccaaacgat tgctctatgg gtctttggta tgcatgtcca 1440 

aggacaactt cgagacattt ctttttgcca ccgtatctaa cagggagcag gaagatctct 1500 
gccgaggaat tgtccagctc tgcttcaatg agcaaagcca acagctgcta gcagaggtcc • 1560 

agccctctga ctctttcctc atggtagaga caactgcata ctttgaggcc tacaggcacg 1620 

tcctggaagg actccaggag gtccaggagg aagatgttcc cttccagagg aatatcgtgg 1680 

agtgtaactc tcatgtgaag gagccaaggt acttgctaat ggggggcaga tatgacttta 1740 

cccccttaat agagaatcct tcagccactg gggaatttct aagaaatgtc gagggtttga 18 00 

gacatcccag aattaatgtc ttagatcctg gccagtggcc ctcaaaagaa gccctgaagc 1860 

tggatgactc ccagatggaa gccttgcagt ttgctctcac aagggaactg gctattattc 1920 

aaggacctcc tggaacaggc aaaacctatg tgggtctaaa aattgttcag gccctcctaa 1980 

ccaacgagtc tgtttggcaa attagcctcc agaagttccc catcttggtt gtgtgttata 2040 

ctaatcatgc tttggaccag tttctggaag gcatctacaa ttgtcagaag accagcattg 2100 

t 9 c Srsrg t 9'g > g tggaaggagc aacagtgaaa tcctgaagca gttcacccta agggagctga 2160 

ggaacaagcg ggaattccgc cgcaacctcc ccatgcacct ccgaagggcc tacatgagta 2220 

tcatgacaca gatgaaggag tcagagcaag agcttcatga aggagccaag accctggagt 2280 

gcaccatgcg tggtgtccta cgggaacagt acctgcagaa gtacatctca ctccagcact 2340 

gggaaagtct catgaatgga ccagtgcagg atagtgaatg gatttgcttc cagcactgga 2400 

agcattccat gatgctggag tggcttaggt cttggtgtcg gfctctttcac gcaaagtgtt 2460 

tctccagcag gacctgagaa tacagcccag gcagaagggg atgaggagga agaaggggag 2520 

gaggagagtt cgctgataga gatcgcagag gaagctgacc tgattcaagc agaccgggtg 2580 

attgaggagg aagaggtggt gaggccccag cggcggaaga aggaagagag tggagcagac 2640 

caggagttgg ctaaaatgct tctggccatg aggctagacc actgtggcac tgggacagca 2700 

gctggacagg agcaagccac aggagagtgg cagacccagc gcaaccagaa aaagaaaatg 2760 

aaaaaaagag tgaaggatga gcttcgcaaa ctgaacacca tgactgcagc cgaggccaac 2820 

gagatcgagg atgtttggca cctggacctc agttctcgct ggcagcttta taggctctgg 2880 

ctacagttgt accaggctga cacccgccgg aagatcctca gctatgaacg ccagtaccgc 2940 

acatcagcag aaagaatggc cgagctgaga ctccaggaag acctgcacat tcttaaagat 3000 

gcccaggttg taggaatgac aaccacaggt gctgccaaat accgccagat cctacagaag 3060 

gtggagccga ggattgtcat agtggaacaa gctgcggaag tccttgacgc ccataccatt 312 0 

gccacattga gcaaaagctt gccagcacct cattttgatt gcggaccacc agcagctgcg 3180 

ccccagtgcc aacgtgtatg atctggccaa gaacttcaac cttgaggtgt ccctttttga 3240 

acggctagtg aaagtaaaca ttccctttgt ccgtctgaat taccagcacc gtatgtgccc 33 00 

tgaaattgcc cgccttttga ccccccacat ttaccaggat ctggagaatc atccatctgt 3360 

tcttaagtat gagaagatta agggggtgtc ttccaacctt ttctttgtag aacacaactt 3420 

tcctgaacag gaaatccaag aggcgcaaaa gccatcagaa ccagcatgag gctcacaatg 3480 

tggtagagct gtgcaagtac ttcctgtgcc aggaatacct gccttcccag atcaccatcc 3540 

tcactaccta taccgggcag ctcttctgcc tgcgcaaact gatgcctgcc aagacatttg 3600 

ctggcgtcag ggtccatgtt gtggacaaat accaagggga agagaatgac atcatcctcc 3660 

tctcgctagt gcggagcaac caagaaggca aggtgggttt tctgcagata tccaaccgca 3720 

tctgtgtggc cttgtcccga gccaagaagg gaatgtactg catcggaaac atgcagatgc 3780 

tggccaaggt gcccctgtgg agcaagatca ttcatacact tcgagagaac aatcaaatag 3840 

gccccatgct ccggctctgc tgccagaacc accctgaaac ccacacctta gtatccaaag 3900 

cttctgactt ccaaaaagta cccgaaggag gctgcagcct gccctgcgag ttccgcctgg 3960 

gctgtgggca tgtctgcacc cgtgcctgcc acccttatga ctcttcacac aaggagttcc 4020 

aatgcatgaa gccatgccag aaggtcatct gtcaggaagg gcaccggtgt ccccttgttt 4080 

gcttccagga gtgtcagcct tgtcaggtga aggtgcccaa aaccattcct cggtgcggcc 4140 

atgaacaaat ggtcccttgt tccgtgcctg agtcagattt ctgctgccag gagccttgct 4200 

ccaagtctct gagatgtggg cacagatgca gccacccatg tggtgaggac tgtgtgcagc 4260 

tgtgttcaga aatggtcacc ataaaactca agtgtgggcc cagttaaacc ggtaaaatgt 4320 

ggttatgtgg aaggcctcct gtatggtggt gtggtagtca agtgtacaca caagtgtggc 4380 

actatcttgg actgcgggca tccttgccca ggctcctgcc acagctgctt cgaagggcgt 4440 

ttccatgaac gctgtcatca gccagtgcta gcgcctgctt atctgctcac acaagtgccc 4500 

agaaaccatg cattggtgag tgcccaccct gccagcggac ctgtcagaac cgctgtgtcc 4560 

acagccagtg caagaagaaa tgtgaggagc tgtgtagtcc ctgcgtggaa cccatgtgct 4620 

cgcgctgcca gcactaccag tgcaccaaac tctgctctga gccctgcaac cgacccccat 4680 

gctatgtgcc ttgtactaag ctgctagttt gtggccaccc ctgcattggt ctctgtgggg 4740 

agccatgccc caagaaatgc cggatctgcc acatggatga ggtcacccaa atattctttg 4800 

gctttgagga tgagcctgat gcccgctttg tgcagctgga agactgcagc cacaattttt 4860 

gaggtgcaag ccctagaccg ctacatgaat gaacagaagg atgatgaagt cgccatcaga 4920 
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ttgaaagtct gccctatctg ccaggtgccc atccgcaaaa acctgagtta tggaactagc 4980 

ataaaacagc ggctagagga gattgaaatc atcgaggaaa agtatccagg gctcatcagg 5040 

ggaaatggca accagccagg aacggcttaa ggccctgctg gagaggaaga gcctcctcca 5100 

ccagctgctt cctgaagact tcctgatgtt aaaggagaag ctggcccaga aaaatctgtc 5160 

agtgaaggac ctgggtctgg ttgagaatta catcagcttc tatgaccacc tggccagcct 522 0 

gtgggattcc ctgaaaaaga tgcatgtctt agaagagaaa agagtgagga ctcgactaga 5280 

acaggtccat gagtggctgg ccaagaagcg cttgagcttc actagccagg aactaagtga 5340 

cctccgaagt gaaatccaga ggctcacata cctggtgaac cttctgaccc gctacaagat 5400 

agcagagaag aaggtgaaag atagcatagc agtagaggtc tatagtgtcc agaatatcct 5460 

tgagaaaaca tgtaagttca cccaagagga tgaacaactt gtgcaggaaa agatggaagc 5520 

tctgaaagcc acccttccct gctctggcct gggcatctca gaggaagagc gagtgcagat 5580 

tgtcagtgcc ataggttatc ctcgtggtca ctggttcaag tgccgcaatg gccatatcta 5640 

tgtgattggc gattgtgggg gagccatgga gaggggcacg tgtcctgact gtaaggaagt 5700 

gattggtggc acaaatcata ctctggaaag aagcaaccag cttgcttctg aaatggatgg 5760 

agcccagcat gctgcctggt ctgacacggc caacaacctg atgaactttg aggagatcca 5820 

ggggatgatg taggaagatg gtacacccct gccttttgcc ctcgccactg aatgcctggg 5880 

gccagctccc taatgaagga actgaagttt gttttttatt atcatccttt tta 5933 



<210> 33 

<211> 2607 

<212> DNA 

<213> Homo sapiens 



<400> 33 

ggaattcgcc ggtcgcgcgc ccgccagcct gccgcctggg ctgggggtca cgaaaggaaa 60 

ccttacagaa acatgaagcc ctcaaccatc tgctactcag ttattcgggg ctgacggcgg 120 

cttctagaac atccaggtgt tctgcagatg cgagaactca tcctgtagtc accagatgga 180 

gtcccaaaca gccaagcaga tgtaaggcct gtgctgtggc tctgaggccc tgaatacaga 240 

agggtcactt tcttagtggc caaagagcag ttgttgacat tgatgtctaa ttattgaaca 3 00 

cgaccagtca ttttactgag ctgcggtgag gaaacactga cccatataag atcaagccca 360 

atgagggatt ggcaatttcc ctgattcttt tgaattaaga ttccagatgg gggcctcatt 420 

tctacagccc ccaacattcc tatagccgtt atcactgcca tcaccactgc caccagcatc 480 

ttcttgcaga ttccacccct gctccccaga gacttcctgc tttgaaagtg agcagaaagg 54 0 

aagctctcag aaaaatctct agtggtggct gccgtcgctc caggacaatc ggaatcctgc 600 

cttcaccacc atgggctggc tttttctaaa ggttttggtg gcgggagaga gtttctcagg 660 

acttctttat cctcttgtgg atttttgcat cagcgggaaa acaagaggac agaagccaaa 72 0 

ctttgtgatt attttggccg atgacatggg gtggggtgac ctgggagcaa actgggcaga 780 

aacaaaggac actgccaacc ttgataatat ggcttcggag ggaatgaggt ttgtggattt 840 

ccatgcagct gcctccacct gctcaccctc ccgggcttcc ttgctcaccg gccggcttgg 900 

ccttcgcaat ggagtcacac gcaactttgc agtcacttct gtgggaggcc ttccgctcaa 960 

cgagaccacc ttggcagagg tgctgcagca ggcgggttac gtcactggga taataggcaa 102 0 

atggcatctt ggacaccacg gctcttatca gccccgcgtt ccgtggtctt gattactact 108 0 

ttggaatccc atatagccat gatatgggct gtactgatac tccaggctgc aaccaccctc 1140 

cttgtccagc gtgtccacag ggtgatggac catcaaggaa ccttcaaaga gactgttaca 1200 

ctgacgtggc cctccctctt tatgaaaacc tcaacattgt ggagcagccg gtgaacttga 1260 

gcagccttgc ccagaagtat gctgagaaag caacccagtt catccagcgt gcaagcacca 132 0 

gcgggaggcc cttcctgctc tatgtggctc tggcccacat gcacgtgccc ttacctgtga 138 0 

ctcaactacc agcagcgcca cggggcagaa aagcctgtat ggtgcagggc tctgggagat 1440 

ggacagtctg gtgggccaga tcaaggacaa agttgaccac acagtgaagg aaaacacatt 1500 

cctctggttt acaggagaca atggcccgtg ggctcagaag tgtgagctag cgggcagtgt 1560 

gggtcccttc actggatttt ggcaaactcg tcaagggtga agttcagcca agcagacgac 1620 

ctgggaagga gggcaccggg tcccagcact ggcttactgg cctggcagag ttccagttaa 168 0 

tgtcaccagc actgccttgt taagcgtgct ggacattttt ccgactgtgg tagccctggc 174 0 

ccaggccagc ttacctcaag gacggcgctt tgatggtgtg gacgtctccg aggtgctctt 1800 

tggccggtca cagcctgggc acagggtgct gttccacccc aacagcgggg cagctggaga 1860 

gtttggagcc ctgcagactg tccgcctgga gcgttacaag gccttctaca ttaccggtgg 192 0 

agccagggcg tgtgatggga gcacggggcc tgagctgcag cataagtttc ctctgatttt 198 0 

caacctggaa gacgataccg cagaagctgt gcccctagaa agaggtggtg cggagtacca 2040 

ggctgtgctg cccgaggtca gaaaggttct tgcagacgtc ctccaagaca ttgccaacga 2100 

caacatctcc agcgcagatt acactcagga cccttcagta actccctgct gtaatcccta 2160 

ccaaattgcc tgccgctgtc aagccgcata acagaccaat ttttattcca cgaggaggag 2220 
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tacctggaaa ttaggcaagt ttgcttccaa atttcatttt taccctcttt acaaacacac 2280 

gctttagttt agtcttggag tttagttttg gagttagcct tgcatatccc ttctgtatcc 2340 

tgtccctcct ccacgccgac ccgagagcag ctgagctgcg ctggctctgg gcagggagtg 2400 

tgccttaatg ggaagcacac gggctttgga gtcaggcaca ggtgccagct ccagcttttg 2460 

aacttgggca attgtttaac ctaacctgca agttgatttt gagggttaaa taaaggcata 2520 

catgaaaatg cctggcaaat tacctgacac agagcagaca ttcaatacat tttagtttcc 2580 

ttgtttctct ggttcccagt ttctctg 2607 



<210> 34 

<211> 1120 

<212> DNA 

<213> Homo sapiens 



<400> 34 

cccgttgcta cgacgatttc gcagcccctg agtctggagg ctgacatgtg gagcataggc 60 

gtcatcacct acatcctctt aagtggagca tcccctttcc tgggagacac gaagcaggaa 120 

acactggcaa atatcacagc agtgagttac gactttgatg aggaattctt cagcgcagac 180 

gagcgagctg gcccaggact ttattcggaa gcttctgggt taagagaccc ggaaacgggt 240 

cacaatccaa gaggctctca gacacccctg gatcacgtca aaaggagaag gcagagcccc 300 

tgaacagcgg aagacagagc ccacccagct gaagaccaag cacctgagag agtacactct 360 

caaatgccac tcaagcatgc ctcccaacaa ctgctatgtc aacttcgagc gttttgcctg 420 

tgtggtggaa gacgtggctc gggtcgacct gggatgccgt gccctggtgg aggcccacga 480 

taccatccag gacgatgtgg aggccctggt ctccatcttc aatgagaagg aagcttggta 540 

ccgagaggag aatgagagcg cccggcatga cctgtcccag ctcaggtatg agttccgcaa 600 

ggtggagtca ttgaagaagc tgctgcgaga agacatccag gccacgggat gtagcctcgg 660 

aagcatggcc aggaaactgg accatctgca agcccagttt gagattctga ggcaggagct 72 0 

ctccgcagac ctgcagtgga ttcaggagct ggtgggcagc ttccagctgg agagtgggag 780 

ctctgagggc ctgggctcca cattctacca ggacaccagt gaatccctgt cagagctgct 840 

cagcagatca tgcactgagg aatttctggc aggctggaag ctctaatcgc cagaatccag 900 

tcagtgatgc atcccaagag gtttgcagaa gcatgttctc atccaccctg ctgcggcctc 960 

cagctagaat tatcctgcag gggtgtgtgt ctgtagtata gccttccagc ccctccacag 1020 

aatccagaat tgctgaggtg ccagttaatt cttcattctg cacctcccct caccaatctg 1080 

gttttctggg agggctctgg cagggcagcc aaactgtatt 1120 



<210> 35 

<211> 264 

<212> DNA 

<213> Homo sapiens 



<400> 35 

cttttggaat tctatttgca aagggtgtgg aggagaaagt gtagggtggt gttgctatct 60 

tgagagcatg ggaaataccc ttgaactgct gaaagcaggt acctatggtg ccagttttca 120 

gtgtggagaa ggatggagag gagctcggta gcttcaggcc acgctgggcc gattggctga 180 

ctggattact ggagtgggtt tcagtagaga gcctttctat ctattgcatt tctcaacctg 240 

tttatatgtg ggtggaataa ggct 264 



<210> 36 

<211> 836 

<212> DNA 

<213> Homo sapiens 

<400> 36 

gagccctccg tggcagtgct tgggtgtccg ggctccgaga ctccggcctg acctctccat 60 

ggggtcgaaa ggacggtctc tggatgccag gagtcgcgga gggcggacca gcatgaggaa 120 

accattggcg gaaaatggca gaagcagtgc ggcatcccag cctcaactgc cgggacggtg 180 
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ttcaagagac attggtgggg taaacataca aaaatgtgat tgcctcactc aaccaagagc 240 

cttagccata attaaaagat gttcagatgg agctgtacag gaatgtgatg ctggagaact 300 

agagcaacag agccacatta gcacaagcag gcccaccgcg gtctcacaca cactcgagcc 360 

ttcctttgcc cagtctcaca tgcatgactg ggatcgcgga tttcgtcccc ttccgactcc 420 

ccatgctggt tcggtcccag acgctcaggt gccacattgg ggtgcgctct cccggctcct 480 

ccacacgcta cgcagctgcc cgcctcaaga gcgtctgcgt gcttccagcg ttaagtggaa 540 

ttccaagcag cctccggggc acagttagtc tggtcactgc ggccagactg ccgcctcagg 600 

acccagagaa cgcagcacac tttagttggg cccatctatc cccggcgagg cttggactac 660 

gtttcccagg agaccctgcg gtccaaagag tggctccccg gggtggggag atgatttgga 720 

cccctgtggg ttggagccag ccccccactt ctccgaggct tcaggaacta caagcaaaac 780 

aaaggacgtg gggccccgca gcccaagcgg agccacagag tgaggagcat cagaag 836 



<210> 37 

<211> 2834 

<212> DNA 

<213> Homo sapiens 



<400> 37 

atgatcagcg cccctgacgt agtggccttc accaaagagg aagagtatga agaagagcct 60 

tacaatgagc cggccctgcc tgaggagtac tcggtgccgc tcttcccctt cgccaagtca 120 

aggtgctaac ccctggtcaa aaactgtccc gggccaagtt ttcgaaggac ttcattctta 180 

attccgaagt ctctgaacaa ggggggaacc ccacccttac tgaccaatcc caatggacac 240 

ccaaagtttt gggcactttt gaatctcaat taacttcttc cttggcgtat catgtctgtg 3 00 

gattaccagg cttccttcgt gggccatcct cctggatctg cctaccccaa gctgaacttc 360 

gtggaggact ccaaggtggt gctgggagat tctaaggagg gcgcctttgc atacgtgcac 420 

caccttaccc tatacgacct ggaggcccgt ggcttcgtga ggccgttttg catggcttat 480 

atctctgcag accagcataa aatcatgcag cagttccagg agctttcagc cgaattttcc 540 

agagcttctg agtgcttgaa gactggcaac aggaaggcat ttgctgggga acttgaaaaa 600 

aagctgaaag acttggatta caccaggaca gtgctacaca cagaaacgga gatccagaag 660 

aaagccaacg acaaaggctt ttactcatct caggcaattg agaaagccaa tgaactggcc " 72 0 

agtgtggaga agtccatcat tgaacatcaa gacctgctga agcagatccg ctcataccct 780 

catcggaagt tgaaggggca tgatttgtgt cctggtgaga tggagcacat ccaggatcag 840 

gccagccagg catccactac ctctaaccct gatgagtctg ccgacacaga cctttacacc 900 

tgcagaccag cctacacccc aaaacttatc aaagcaaagt ccaccaagtg ttttgacaag 960 

aagttgaaga ccttggagga gctctgtgac actgaatatt tcacccagac cctggctcag 1020 

ctcagccaca ttgaacacat gttcagagga gacctgtgtt acctcctgac cagtcagatt 1080 

gatagagcac ttctaaaaca acagcatata acaaactttc tctttgaaga ctttgtggag 1140 

gtcgatgaca ggatggtgga gaaacaagaa agcataccct ctaagcccag tcaagacagg 1200 

ccgccttcca gttctctaga agaatgccca attcctaaag tgttaattag tgttggttct 1260 

tacaagtcca gtgtggagtc tgtgttgatc aagatggagc aggaactggg agatgaggag 1320 

tacaaggaag tggaagtgac tgagttgagc agtttcgacc cccaggaaaa cttggactac 1380 

ctggatatgg atatgaaagg gagtatcagc agtggtgaaa gtattgaagt tttgggcacg 1440 

gagaaatcca cctccgtgct ttctaaatct gacagccagg caagcctcac agtaccattg 1500 

agcccccagg tggtccggag caaagcagtc agccacagga ccatcagcga ggacagtatt 1560 

gaagtcctca gtacctagcc cctctgaggc cctcatccat gatgacttta aggccagcta 1620 

cccaagtgcc attaatgaag aagaatcata tccagatggc aatgaacgag ccatccactt 1680 

ccaggcaagc atcagtcctc cagaactggg tgagacagag gaaggcagca tagaatacac 1740 

cccatcacaa atagactcct gctgctgtat tgggaaggag agcgatggtc agttggtgct 1800 

gccctccact ccagcccaca cacactctga cgaggatggt gtggtgagca gccccccaca 1860 

gcgccacagg cagaacgacc aggggttccg tgtagacttt tcagtggaaa atgccaaccc 1920 

ttcttcccga gacaacagtt gtgaagggtt tcccgcttat gagctggacc cgagccacct 1980 

gctggctagc cgggacatca gtaagaccag cctggacaac tactcagaca ccaccagcta 2040 

cgtgagcagt gtagcgtcca ccagctcaga caggatcccc tctgcttatc ctgctggcct 2100 

gtcttccgat aggcataaaa agagggctgg ccagaacgcc ttaaaattca tccgccagta 2160 

cccctttgcc cacccagcca tctactccct gctcagtggg aggacacttg tggtcctggg 2220 

ggaagatgag gccatagtca ggaaactcgt gactgcactg gctatctttg tccccagcta 2280 

tggctgctac gctaagcccg tgaaacattg ggcctcctcc cctttgcaca ttatggattt 2340 

tcagaagtgg aagcttattg gcttgcagag agtggcctcc cctgccggtg ccggtaccct 2400 

ccatgccctg agccgctaca gccgctacac gagcatcctg gaccttgaca acaaaaccct 2460 

gcgctgcccc ctttacagag gcaccctggt gccccgcctg gcagaccacc gcacacagat 2520 

caagcggggc agcacctact acctgcatgt ccagagcatg ctcacccagc tctgctccaa 2580 
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ggccttcctc tacaccttct gccaccacct gcacctgcct acccacgaca aggagacaga 2640 

ggagctggta gccagccgcc agatgagctt cctaaagctg accctgggtc tggtgaatga 2700 

ggatgttagg gtggtccagt acctggctga gctgctgaag ctgcactaca tgcaggaatc 2760 

tccagggacc agccacccca tgctcaggtt tgactatgtc cccagctttt tgtataaaat 2820 

ctgaggtcgg tccc 2834 



<210> 38 

<211> 909 

<212> DNA 

<213> Homo sapiens 



<400> 38 

gctaattgaa caagccccaa ttcaaatggg agaagaggca gtgaggtggg caaaactggt 6 0 

cataccttta gtggttcatt cagcacaaaa ggtacatttg cggggagcaa ctgctctgga 12 0 

gatgggaatg ccattattgc ttcagaaaca gcaagaaata gcatctatta cggagcagct 180 

tatgactact accttgcatc gaagtgggag tttcatcaat tctctcttgc aactagaaga 24 0 

acttggattt cgtagtggag cacccatgat taaaaagata gcttttattg cttggaagag 300 

tttaatagat aattttgctt taaatccaga tatactatgt agtgcaaaaa gactcaagtt 360 

gttaatgcag cctttgagtt ccatccatgt gagaacagaa actctagcat taacaaaact 420 

agaagtctgg tggtatttac tgatgagact tggacctcat cttcctgcta attttgaaca 480 

gggttgtgtg cctctgattc aaagtacaat aggcattgat tctaatgcct aacctcaggg 54 0 

caattcgtgt catgtagcta catctccagg tttaaatcct atgactcctg tacacaaagg 600 

tgcttcctcc ccgtacggag ccccgggaac tccccgaatg aacctgagtt cgaatttagg 660 

tggaatggcc acaatcccat ccattcaact tttgggactt gaaatgttgc ttcatttctt 72 0 

gttgggtcca gaagccttga gttttgctaa gcaaaataaa cttgtgctga gcttagagcc 78 0 

attggaacat ccgttaatca gcagcccttc ctttttttcc aaacatgcaa atacacttat 84 0 

cactgctgtt catgatagct ttgttgcagt tggaaaagat gcccccggta agagaatttg 900 

atttattat 909 



<210> 39 

<211> 1468 

<212> DNA 

<213> Homo sapiens 



<400> 39 

aggaagaaat cagaaaaatt cgagatctct caaatcagga agaacagtac aatcgattca 60 

tgaaattggt tggtggcaag aggagatcaa gaagtaaatc ttcagatcct gacctgaggc 12 0 

gatccttaga taagcaacct actgatagtg gaggaggcat ttatcagtat gataactatg 18 0 

aagaagttgc tatggataca gatagtgaaa ccagttctcc agctccttca ccagtgcaac 240 

cgccattttt ctctgaatgt tcattggggt atttttctcc agcaccatct ctttctttgc 300 

ctccaccacc tcaggtttct tctctgccac ctttgagcca gccttatgtg gaaggcttgt 360 

gtgtttctct tgaacctcta cctcctctac caccattacc acctctccca cctgaagatc 420 

cagaacagcc tccaaaacca ccttttgcag atgaggaaga ggaggaagaa atgctgcttc 48 0 

gagaagaact acttaaatct ctagcaaata aaagagcttt taagccagag cttccaaagc 540 

ataaatcagt ggttgtaaca ctaaatgatt cagatgatag tgaatctgat ggagaggctt 600 

ccaagtcaac aaatagtgtt tttggtggat tagagtccat gattaaagaa gcaagacgaa 660 

ctgctgagca agcttcaaaa ccgaaagtac ctccaaaatc tgaaaaagaa aatgatcctc 72 0 

tgcgaacacc ggaggctttg cctgaagaaa agaagattga atatagattg ttaaaaggaa 780 

gagattgcca atttaagttt attcttgggt gtaatgcata tatccttttt gataatttta 840 

gccgtgagaa acagcgtttg attaaatcag atcagctgaa gacaagttca tcatccccag 900 

caaactctga tgtggaaatt gatggtattg gcaggatagc aatggttact aagcaggtta 960 

cagatgcaga atcaaaactg aaaaaacata ggattctctt gatgaaagat gaatctgttt 102 0 

taaagaattt agtgcaacaa gaagctaaga agaaagaatc tgttagaaat gctgaagcaa 108 0 

agattacaaa acfctacagaa cagcttcaag caactgaaaa aattcttaat gttaacagaa 114 0 

tgtttttgaa gaagcttcag gaacaaattc acagagttca acagcgtgtt acaattaaga 1200 

aagctttgac tctaaaatat ggagaagagc ttgctcgggc aaaggcagtg gccagtaaag 126 0 

aaataggaaa acgtaaactg gaacaagatc gctttggggt gagggtttta tagaaatatt 1320 

ttacttttgg aaagcagaaa gtttttatga tagttttaag ggtaagatgt atatataaag 1380 
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taactctttg ctgtaaattc caagagatta agattaaagc tttgttttta aatgacatcc 1440 
acatttttgg atgtctactt tattaaaa 1468 



<210> 40 

<211> 602 

<212> DNA 

<213> Homo sapiens 



<400> 
gtcttctcct 
caggggtctg 
agacacatcg 
cacgaaggtg 
taccccaact 
gtgggagcgt 
gaggaagaca 
tgggtctgga 
ctttggaatc 
gaatatcatc 
tt 



40 

tcctgccacc 
tcaaggcaac 
cgtccgtcac 
tgttccaagg 
agcaagagtg 
ccctcgtgct 
cacgaaggct 
gccaagcgta 
gcctacaccc 
ttccacggaa 



ttgtgaagaa 
acacaaaatc 
aggacacact 
ccacatctcc 
gcccgttcat 
gccttcattt 
actgtgcccc 
tctctctgta 
tgctgcaccc 
tctctgtctg 



gataaaaata 
tccagttacc 
tccgatggat 
aacctcacag 
tggccgaggt 
ttcctaccct 
caccacaaga 
cccatcgcac 
acgatgtggc 
cacaaaaggg 



aaaatgaagt 
ttctggtgga 
ttacagcaac 
agaggacagg 
tcttgaggat 
cagctcaggg 
gagggaaaca 
acgaatttga 
tgtaactcca 
gcacatttaa 



caccagccct 
tgacgtttgg 
tcctgggggt 
aagtgatcag 
agagtgggag 
gtctgtgcta 
gggagacgcg 
atcatctgct 
ttaaagaatt 
ttttctgttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
602 



<210> 41 

<211> 789 

<212> DNA 

<213> Homo sapiens 



<400> 
tttttttttt 
gcggctacag 
ttgatcctgg 
tcggtggtca 
aacttctgac 
ggccgtgcgt 
ctggcatctg 
tggtccactt 
ccgtggtcgc 
ttacgcgtca 
tcggacgcgt 
aatgggatat 
tctcagcgag 
gagcacacg 



41 

ttagctgggg 
ggagaaggaa 
agtgaaggtc 
ggagcacccg 
aaggaggtga 
agactggtca 
ggtcctgtcc 
gtcctgggct 
ccgcgggaac 
cctcgggcgg 
ctccccaaga 
ttgggggcgg 
accactttac 



aaagggtgat 
gagacagatt 
caagatccct 
ataaagtcct 
tccccaggct 
gcttccgggg 
gtaggatcag 
gctgggtttc 
ttcaagggtc 
gacccgatcc 
aaggtacccg 
cgagtcggtc 
ttgtacgcaa 



caccttatat 
aaaccttttt 
ccagggatgg 
tcccaggtta 
ggtgtcggtg 
tgacttagag 
ccggtttgga 
aagccgactg 
cttcctaccc 
tccgctcctg 
ccagcctctc 
ccgcaaagag 
ggcggctgcc 



tgataataag 
gaatgttaat 
ggccctttcc 
gctgagtttt 
aattccttga 
cagggctgtg 
tggtctgggt 
tggagatcga 
gcgttgctga 
aggcccccac 
ctgaaacaga 
aaaccagcac 
tgcgtgggat 



acgtccttgg 
ttggaaggcg 
gaccgtcatc 
tcctctttcc 
cgaacgcttc 
gtcactttca 
cttgctggca 
aggcacagtt 
gagtctgggt 
aatgaagcag 
atgggatcca 
ttagaaaatt 
gcaatctcca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
789 



<210> 42 

<211> 806 

<212> DNA 

<213> Homo sapiens 



<400> 
cgccgcggaa 
tcccctatag 
ccagcggggt 
aaggtcgtac 
gcgtgcccgt 
aggcatatgg 
atgagctgaa 



42 

ttcccgggtg 
tagagttgct 
ggacccttgc 
gatcatattc 
gctccagcat 
ccagggactc 
cgacatggct 



gaccatctat 
ttaatggcca 
tcccgtcata 
acatcccgcc 
gtgaggctca 
cgcctgacac 
tgtgtgacat 



gaccttgttt 
tgtgcaggac 
gcctgtggga 
atctgcatta 
cgcgctgcgt 
tcacgaggca 
ccctgatgaa 



acgtgcagcc 
cgccgttctg 
cattctgctg 
ctctgatccg 
gcctcccttg 
gccttctgtt 
gagctatatt 



ctaaaaagaa 
aatgagccca 
aagtaccgag 
ctgagtgacc 
tgcctgaagg 
ctgtaggccc 
ccacacgcat 



60 
120 
180 
240 
300 
360 
420 
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tgctcaaaga cagcagtgga agtgatctga cctacaccat tccaaaggac acagacaagg 480 

cctgcttgaa agggctcttc caggccctgg atgagaacct gcatcagctg cacctgacgg 540 

gctatgggat ctcagacacc accttagaag aggcagaagg gaggaccgca gcacctgagc 600 

ctcccatgct ggaggacggc cacgcggtca ctcagcgctt cagcttcatc caggtggtgg 660 

gttgtgaaga tgatcgtacg acctgggtcc aagcacaggg tgcatcagca ccaggaggac 720 

aaaggcccca ggaggacctt ccctcctttc ctcaggatgg tagatccagg gcacaattta 780 

aagatcctca ccaattttca aactga 806 



<210> 43 

<211> 1597 

<212> DNA. 

<213> Homo sapiens 



<400> 43 

atggcttctc atgcttatga taaaaatcaa aatgccaatg tcttagtcca cttgtgcttc 60 

tataacagaa tacccaagac tggagcatat tacctagatt ctaggagtgt gagcatcagt 12 0 

tatttgattg gccatcatat agacatgggt cttgagacag ccacttccaa aaatgaattc 180 

attttcgata gtgcaagtac actgctgggg atgcttttta ggaaaccctc ccaacactct 240 

ctctcccttt ttagcaagaa attccaggag aatctgatat acctggaatc tgatgactgc 300 

ttgccacctc ctccccctcc accttggtct gagccaccgt catttctcac ctggactatc 360 

gtcacagttt tccagtgggt ctccctgctt ctgtccttgc ccaacataca ggtcattctt 420 

tacagagcag ttggagtagt accttcacaa cctaagtcag acaacctgaa aggatgggga 48 0 

tcggggagag tcgttaaaga gaagctcaga tcagagatac ctgactggaa gattaaaagt 540 

attcacatac tggagaggac tgcttcctct agcaccgagc cctctgtaag tcggcaattg 600 

ctagaaccgg agccagtccc cctctccaag gaagctgaca gctgggaaat tatagaaggg 660 

ctgaaaatag gccaaaccaa tgtccagaaa ccagacaaac atgagggctt tatgctgaag 720 

aaaagaaaat ggcctttaaa aggctggcac aagattcaga aaggaaaggt ccatgggagc 780 

atagatgtgg gactctcagt catgtcaatt aaaaagaaag ctcgaagaat agaccttgac 840 

accgaagagc acatctatca tttgaaggtg aaatcagttt tcaacagttt ctctgccatt 900 

atcaggggaa acgatttgcc tacaccagtg gtgaaatccc aggactggtt tgatgcatgg 960 

gtctccaaac tgcgacatca tcggttgtat cgtcagaatg aaattgtgag atcaccaaga 1020 

gatgctagtt ttcacatatt tccttcaacg tccacagctg aatcctcacc agctgctaat 1080 

gtttctgtaa tggatggaaa gatgcaacca aacagctttc cgtggcagtc ccctttacca 1140 

tgcagcaata gcctccctgc aacgtgcaca actggccaga gtaaagtggc agcctggtta 1200 

caggactcgg aagagatgga caggtgtgca gaagatcttg cacattgcca gtcaaacctt 1260 

gtggaactta gcaaactcct gcaaaatttg gaaatacttc agagaactca gtcggcacct 1320 

aactttactg acatgcaggc taactgtgta gatatttcaa agaaagacaa gcgggtcaca 1380 

agacgatgga gaacaaaaag tgtcagcaaa gatacaaaaa tacaactgca ggttcctttc 1440 

agtgctacca tgtcaccagt tcgcttgcat tcctccaacc ccaacctttg tgcagatatt 1500 

gaatttcaga ctccccctag ccacctcact gaccctctgg aaagttcaac agattataca 1560 

aagctgcaag aagaattttg tctaatcgca cagaaag 1597 



<210> 44 

<211> 1827 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 
<222> (1) . . . (1827) 
<223> n ~ a,t,c or g 



<400> 44 

agctgttcag gtgttcgtgc tgaccaggta ccaggtcaaa gcttccagag aggaccgtca 60 

ggggccagtc tttttaagat tgtggctgat aaaactccat accttactat ggaagaaatc 120 

acaaggacca ttcatattgg accaagtaga ctagggcatc cttgcttcta tcatcagaag 180 

gatataaaac tagagaacct catcataaag cagggtgagc aaatcatgct caactcagtt 240 

gaagagattg atggagaaat aatggtgagc tgtgcagtag caaggaatca tcaaactcac 300 
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tcatttaatt tgcctttgtc acaagaagga gaattctacg agtgtgaaga tgaacgtatt 360 

tacactctaa aggagattgt tgaatggaag attcctaaga acagaacaag aactgtaaac 420 

cttacagatt tttcaaataa gtgggactca acgaatccat ttcctaaaga cttttatggt 480 

accctgattc tcaagcctgt ttatgaaatt caaggtgtga tgaaatttcg aaaagatata 540 

atccgcatcc tccccagtct agaatgtcaa gtcaaagaca tcactgattc ttacgatgct 600 

aactggtttc ttcagctgtt atcaacagaa gatctttttg aaatgactag taaagagttc 660 

ccgcataggg actgaagtca tagaaccacc tgaaggaaac cacctgcccc aaagcatttt 72 0 

acagcctggg aaaaccattg tgatccacaa aaagcaccag gcatcaagaa tcttaacttc 780 

agaaattaga agcaattttc ctaaaagaca cttcttgatc cccactagct ataaaggcaa 840 

gttcaagcgg cgaccgaggg agttcccaac ggcctatgac ctagagatcg ctaagagtga 900 

aaaggagcct cttcacgtgg tggccaccaa agcgtttcat tcccctcatg acaagctgtc 960 

atccgtatct gttggggacc agtttctggt gcatcagtca gagacgactg aagtcctctg 1020 

tgagggaata aaaaaagtgg tgaatgttct ggcctgtgaa aaaatcctca aaaagtccta 1080 

tgaggctgcg ctgctccctt tgtacatgga aggaggtttt gtagaggtga ttcatgataa 1140 

gaaacagtac ccgatttctg agctctgtaa acagttccgt ttgcccttca atgtgaaggt 1200 

gtctgtcagg gatctttcca ttgaagagga cgtgttggct gccacaccag gactgcagtt 1260 

99 a ggaggac attacagact cttacctact cataagtgac tttgccaacc ccacggagtg 132 0 

ctgggaaatt cctgtgggcc gcttgaatat gactgttcag ttagttagta atttctctag 1380 

ggatgcagaa ccatttctag tcaggactct ggtagaagag atcactgaag agcaatatta 144 0 

catgatgcgg agatatgaaa gctcagcctc acatccccca cctcgccctc cgaaacaccc 1500 

ctcagtagag gaaacaaagt taaccctgct aaccttagca gaagaaagga cggtagacct 1560 

gcccaagtct cccaagcgtc atcacgtaga cataaccaag aaacttcacc caaatcaagc 1620 

tggcctggnn nnnnnnnnnn nnnnnnnimn nnnnnnngat ttggtggatg aagagaaaga 1680 

taggagcaac cgtggggcca cagcattagc agaaacattc aacaatgata aacatcataa 174 0 

atatcaagat gtgacagaag ccacttaggc agcatacata aatgttgcag tgaaaaaaga 1800 

tgctagcctt ctagctgaca aacgagg 182 7 



<210> 45 

<211> 2435 

<212> DNA 

<213> Homo sapiens 



<400> 45 
atgtttgcgt ttgaacctct 
gggagccggc ttttccgcac 
cgggaagccg cgcagtggtt 
agcagcgacg ccagcatcgg 
cccgacttcc ccggcagccc 
aaggaccggc ccagcctgac 
ctaaccgtga ccccaagacg 
ccctgtggcc cgctccgact 
gacctcagcg tgtgcggcca 
ttcagctctc tggcctcgcc 
atcggcacct ccgcctgtct 
gaagtctccc tggaccgagc 
aaggacacca ggatggtcca 
atgaactcag gaacccctga 
tcctgctgta aaaggaaact 
acaggcaaga ggagggccac 
gtccggagag agcatcagga 
gacaggctgg agagaactag 
tctctcctcc attcccaccg 
acccaggacc tgactccttt 
agtttccaca agaagaaaat 
gccacttctc tctctggatc 
acaagtggtg ctccgtcctc 
actctaagta tttcaaacaa 
cagaagggtc ctgtcccctt 
aagattgggg aaggggtgtt 
agccataaaa atcattgcta 
ctttgaggaa atcctgccag 



tggcgggtgc cggccatggc ggctctcgct cccgggacct 60 

atatggggct gcggacggca ggagacagcg gcggccgggc 12 0 

cccgccgcag gaccggaggc gtttcttcaa cagcagcggc 180 

cgacccctcg cagtccgacg atcctgacga tcccgacgac 240 

ggtgaggcgg cggcgaaggt gtcccggcgg ccgagtgccc 300 

cgtgacccca aagcgctgga agctgcgagc tcgcccaagc 360 

cctggggctg cgagctcggc ccccgcagaa gtgcagcaca 420 

tccgcccttc cccagccgcg actccggccg cctcagcccg 480 

gcccagggac ggcgacgagc tgggcatcag tgcctccctg 540 

ctgccccggg tccccaacgc caagggacag tgtcatctcg 600 

ggttgcagcc tcagccgtcc cgagcggcct ccacctccca 660 

atctctcccc tgctcccagg aggaagcgac aggaggagcc 720 

ccaaacccgc gccagcctca ggtcagttct ctttggcctt 780 

ggattctgag tttcgggcag atgggaagaa tatgagagag 840 

ggtggtggga aatggaccag agggtccagg tctgtcaagc 900 

aggccaggac tcttgtcaag agagagggct tcaagaggcc 960 

ggccagtgtt cccaagggcc gcattgtgcc aaggggaata 1020 

atcaagccgg aagagcaaac atcaggaggc aacggaaacc 1080 

ctttaaaaag ggccaaaagc tgggaaaaga ttcgttcccc 1140 

acagaatgcc tgcttttgga ccaaaaccag ggcttccttc 1200 

tgtgactgat gtgtcagagg tctgcagcat ctataccact 1260 

cctcctatca gaatgttcaa accggcctgt catgaacaga 1320 

ttggcactcc tcctctatgt atttgctaag ccccttaaac 1380 

aaaggcatct gatgctgaaa aggtttatgg ggaatgcagt 1440 

tagccattgc cttcccacag aaaaactgca acgctgtgag 1500 

tggcggaagt gtttcaaaca attgctgatc acacacccgt 1550 

ttgaaggacc agatttagtc aatggatccc atcagaaaac 1620 

agatcatcat ctccaaagag ttgagcctct tatccggtga 1680 
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agtgtgcaac cgcacagaag gctttatcgg gctgaactca gtgcactgfcg tccagggatc 1740 

ttaccctccc ttgctcctca aagcctggga tcactataat tcaaccaaag gctctgcaaa 1800 

tgaccggcct gattttttta aagacgacca gctcttcatt gtgctggaat ttgagtttgg I860 

agggattgac ttagagcaaa tgcgaaccaa gttgtcttcc ttggctactg caaagagcat 1920 

tctacaccag ctcacagcct ccctcgcagt ggcagaggca tcactgcgct ttgagcaccg 1980 

agacttacac tgggggaacg tgctcttaaa gaaaaccagc ctcaaaaaaa ctccactaca 2040 

ccctcaatgg gaagagcagc actatcccca gctgtgggtt gcaagtgagc atcattgact 2100 

acaccctgtc gcgcttggaa cgggatggga ttgtggtttt ctgtgacgtt tccatggatg 2160 

aggacctgtt taccggtgac ggtgactacc agtttgacat ctacaggctc atgaagaagg 2220 

agaataacaa ccgctggggt gaatatcacc cttatagtaa tgtgctctgg ttacattacc 2280 

tgacagacaa gatgctgaaa caaatgacct tcaagactaa atgtaacact cctgccatga 2340 

agcaaattaa gagaaaaatc caggagttcc acaggacaat gctgaacttc agctctgcca 24 00 

ctgacttgct ctgccagcac agtctgttta agtaa 2435 



<210> 46 

<211> 786 

<212> DNA 

<213> Homo sapiens 

<400> 45 

ctcggctgac tccaaggaag tccagggcaa cgtgattgtc catggccact tgtgttggtg 60 

cattccgaac aaacagtctg ctgcctggct gcacaggtct gaccagtgtg gtcatgggtg 120 

ggtgacgatg taacaaggtg cattctggct gctgttttcc agccccagat ccttcttcct 180 

cctctcggtc ccaggtggct tccagagtca tcccgctggc tcctcctgca tggcaagtcc 240 

cagttagctg tacagaatct gcagaaggtg gctgcaatga acgggaggaa ggaggaaggg 300 

gaaaggctga ccaaggaggt gatgagctcc tacatccaaa gcgagtttgc aagtgtctgc 360 

acctccaact caatcttgga cctcttccga accccggcca tccgcaaggt cacatgctgt 42 0 

ctcatggtga tttggagaat ggccccacct gcaggcagag agctccggat agctgccgag 48 0 

tcactttctc agcaaaaacg ggcttttgct gtttccagaa ggatccaaga gaggactttc 54 0 

agctctggaa ttttgaactc aggatctgtt tctccctcgc ggaaagagga aggtggacga 600 

agggcaagcc caggccggca ggggctaccg caggaagacg cccattcctg gactcgggtt 660 

cgcaggagcc cccttgctcc ccagagccgg aactgcgctg cctgccacgc tcggctgact 72 0 

ccaaggaagt ccagggcaac gtgattgtcc atggccactt gtgttggtgc attccgaaca 780 

aacagt 78 6 



<210> 47 

<211> 749 

<212> DNA 

<213> Homo sapiens 

<400> 47 

ggggagaatc gatgacatta ggccactgga ggccattagc gagtgagctg agctgcgcgg 60 

cgcggcgcgg cgcggcacgg cacggtggga gtgtctccgg ctggcctgca gggagaacac 120 

cgactgagac ctctaaccct ggctccagtg gcatgtgata accagtcgtt ctgctcagtg 180 

gtgggggtgg tatttgccga gccagaatgt gtgtgctgtt ttaggaatcc gtcacctttg 240 

aggatgtggc cgtggagttc atccaggagt gggcattgct ggacagcgca cggaggagcc 300 

tgtgcaaata caggatgctt gaccagtgca ggaccctggc ctccagggga actccaccat 360 

gcaaacccag ttgtgtctcc cagctggggc aaagagcaga gccaaaggca acagaacgag 420 

ggattctccg tgccacatgt gttgcctggg aatctcaact taaacccgaa gagttgcctt 480 

ctatgcagga tcttttggaa gaagcatcct ccagggacat gcaaatgggg ccggggctgt 540 

tcctgaggat gcagctggtg ccctccatag aagagaggga gacaccattg actcgagagg 600 

accggccagc tctccaggag ccgccttggt ctctgggatg cacgggactg aaggccgcta 660 

tgcagattca gagggtggtg ataccagtgc ctactctggg ccaccgcaac ccatgggtgg 720 

ccagggattc tggtgccatt ggaaatggg ~ 749 



<210> 48 
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<211> 1622 

<212> DNA 

<213> Homo sapiens 



<400> 48 

tttcgtaaag aacaagacag aactatctct gtttctggct ccactgcctg ccagtgaagg 60 

agttttcatt cagactttcc gaagagaggt ggagaaatct aaagactgag gagaagagat 120 

cctttgagcc agatggggca ttagttcttc tgcttttctc agcatggata aaccatttcc 180 

tcaaggattt tctcatgtgc cctgaaatcc atgtaactac aagggctcct ctttatcacc 240 

ataagtgcca ccctgactta aaaccactca gagctaaaaa atcaaggcaa aatggatgct 3 00 

gcggtgacag atgattttca acaaattctg cctattgaac agctgcgctc tactcatgct 360 

agcaatgact acgtggaacg gcctccagcc ccctgtaaac aggccctctc cagcccttcc 420 

cttattgtgc aaacccacaa gtctgattgg tctctggcta ccatgcctac ttctcttccc 480 

cgcagtctca gccagtgcca tcaactgcag cccttgcctc agcatctgag ccaatctagc 540 

attgccagct caatgtccca tagcaccact gcctctgatc aaaggctctt ggccagcatt 600 

acaccctcac ottcaggcca atccatcatc cgaacccaac ctggagcagg ggtccaccca 660 

aaggctgatg gtgctctgaa gggagaagct gagcaatctg cagggcaccc tagtgagcac 720 

ctcttcatct gtgaggaatg tgggcgctgc aagtgcgtcc cctgcacagc agctcgccct 780 

ctcccctcct gctggctgtg caaccagcgc tgcctttgct ctgctgagag cctcctcgat. 840 

tatggcactt gtctctgctg tgtcaagggc ctcttctacc actgctccac tgatgatgaa 900 

gacaactgtg ctgatgagcc ctgctcttgt gggcctagtt cttgctttgt ccgctgggca 960 

gccatgagcc tcatctccct cttcctaccc tgcctgtgct gctacctgcc tacccgtgga 102 0 

tgcctccatc tgtgccaaca gggctatgat agcctccggc gaccaggctg ccgctgcaag 108 0 

aggcacacca acactgtgtg cagaaagatc tcttctggta gtgcaccctt ccccaaggcc 1140 

caggaaaagt ctgtatgacc ttccaacaag gtggatccag agcttttctc cttctagtcc 1200 

ccaacagcaa agcataggcc tcatctttgg agagggggag gagtgataaa ctagccaaag 1260 

ttagggcctc tcttttgttc ctgcagtgtc aggggaatga ccaagtacat cctggtgcag 132 0 

gatgccttgt tctttctcac agtatctatc ccactcctct tcagtcttta caccctgcca 1380 

gctcagccct ttatggttgt catgcaaaat tcaggtgata tatgggtatg aggtttgaac 1440 

actgaggact gacagggcca gcaacgtgga ggtttagggg ctccccaatg taatacctct 1500 

cgatgcaggc tctgatcgtc actctgtttt ctgctgtgcc tttggaagct ttcttctaag 1560 

atggttttca caggtacatg tggaacagcg ttcaaccttc cagggaatac gacccctgaa 1620 

tt 1622 



<210> 49 

<211> 1219 

<212> DNA 

<213> Homo sapiens 



<400> 49 

gcgttaccgc gccgcgaatg cccgggtcga cgatttcgtg ccacccgaac agggagtaaa 60 

tagtgctttg tatctttaag gaagcccttt tttaaattta aaaaaaaagt cttaattata 12 0 

agactcttta aatgcccaac tgcatacttg gagctttggg actgaatttg gaactttcct 18 0 

gtcaagcgac ctcccacgac tttactgctg agcctgtgca cgtgtgtgta aggggagaaa 240 

tccaggcatc tagatgcaga cttgtaccca gttacttggg gtcgcgtgcg ctcacctggg 300 

acctgggctc gtgcgcttag tccggagccc tgatctgcga acaggatatt aaaactttta 360 

gtacaattga ttggactact tgaaccatcg ggatttgggg aggaactcca gatttttcat 42 0 

ttttaaactc taaatgtatg aggaatttac agaatggaga acgaaaagga aaatctcttt 480 

tgtgagccac ataaaagggg actaatgaaa acacctctga aagaatccac cacagcaaat 540 

atcgtgttgg cagagatcca gcctgacttt ggccctttaa ccacacctac caagcccaag 600 

gaaggctctc agggagagcc gtggacaccg acagccaacc tgaaaatgct catcagtgct 660 

gtgagccctg agatccgcaa cagagatcag aaaaggggtt tgtttgacaa cagaagtgga 720 

ttacctgagg ccaaagactg tatacacgaa cacttatctg gagatgaatt tgagaaatcc 78 0 

caaccaagtc gaaaagagaa aagtttagga ttattgtgtc ataagttctt agcacgatat 840 

cctaattatc ccaaccctgc tgtgaataat gacatctgcc ttgacgaagt ggcagaggaa 900 

cttaatgttg aacgtcgacg catttacgat atcgtgaacg tcctagagag tttacatatg 960 

gtgagccgcc tcgccaaaaa caggtacact tggcacgggc gacacaatct caacaaaacc 1020 

cttggcacct tgaagagcat cggggaggag aataagtacg ccgagcagat tatgatgatc 1080 

aaaaagaaag aatatgagca agagtttgac tttattaaga gttacagtat agaggatcat 1140 

atcatcaaat caaacactgg cccaaatgga cacccagaca tgtgttttgt ggaactccct 1200 
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ggagtggaat ttcgggcag 1219 



<210> 50 

<211> 1241 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1)...{1241) 
<223> n = a,t,c or g 



<400> 50 

gcaccgtccc gcaattcccg ggtcgacgat ttcgtaggaa agaatagggc acaggtgggg 60 

atggatggtt ggttggatat gtcacttttc ttatggtaaa caattaaatc ccatattctc 120 

tctgccttaa aagaagaaat taatgttttg tagtcctagg taattgatgt tttgcccatg 180 

atttcccaaa cttgtgtcag tcccacgtta cacgcaaact aaattttagg tttgaaattt 240 

gtccctagtt aattggtctg cttgacaatt ttgtgagtct tattaacccc aatcaataga 300 

gttgagagac tatggcttta aaaaattaat gcaaacctgg gctttagctg taataacacc 360 

cacctaggaa taaattaata ttaccataag gaaaatgtga tactttctga tcttgttttt 420 

aaagttgaaa tgcaacaaac tttttcttgc tgtatataaa tattctggca tagtattaat 480 

aaggcatagc ttttcaggaa attgtccaca aaagggtttt attctctttg gcttgtggac 540 

tatttttcat tgaagcatgc gcttacctat gctgattctt actaaaagca taggctgggg 600 

tatttattgg cgaaaggaaa tgtgtagtgt gggctggact gttggtggag gctggctttt 660 

tagcccactt gctatacatg ctgccaatgg atttaagacc tgaaaatggt gaaagttgag 720 

tgggattatt tccctcctga cacacttaat ttacagtact cctctctacc ccttaggttg 780 

ggctctgcct cagaggagtg agtttttttt ttttttctat aaagtttaca ttgtcttact 840 

atttattgag ttgaattttt ggtcatggcc tatgcaaata taagaaatcg ggctttaaat 900 

attagtcagt tttcatggct atgactagat tgttttcttg tataactaaa tacctgtata 960 

aaatgaacta atgttttttc tcccctccct aaccccttcc ttatgaacaa tgctttaggt 1020 

gtattgaaat ctttaagacc agtagagctg aagttagaac tcattatgtt ccaccgcgaa 1080 

agcttatggc catgcagcgg ccgggtcccc atcacagcct tggaaatggt cgaggctatt 1140 

ngagctttgg cagaggaaat gatattaaaa agatgagggt tgncgcggcc gcgaattcca 12 00 

gatctgtgtc ctcgtagatn ctaaaaacgc cagtattttg t 1241 



<210> 51 

<211> 811 

<212> DNA 

<213> Homo sapiens 



<400> 51 

ggagcctggg gccagagggc cagacagcca cagagctcct ggcgtgggca aggctggcca 60 

aggatggcga cgcccagggg cctgtttggc cctgctcctg ctcctcctgc tcccgacctc 120 

aggtcaggaa aagcccaccg aagggccaag aaacacctgc ctggggagca acaacatgta 180 

cgacatcttc aacttgaatg acaaggcttt gtgcttcacc aagtgcaggc agtcgggcag 240 

cgactcctgc aatgtggaaa acttgcagag atactggcta aactacgagg cccatctgat 300 

gaaggaaggt ttgacgcaga aggtgaacac gcctttcctg aaggctttgg tccagaacct 360 

cagcaccaac actgcagaag acttctatct ctctctggag ccctctcagg ttccgaggca 420 

cgtgatgaag gacgaggaca agccccctga cagagtgcga cttcccaaga gcctttttcg 480 

atccctgcca ggcaacaggt ctgtggtccg cttggccgtc accattctgg acattggtcc 540 

agggactctc ttcaagggcc cccggctcgg cctgggagat ggcagcggcg tgttgaacaa 600 

tcgcctggtg ggtttgagtg tgggacaaat gcatgtcacc aagctggctg agcctctgga 660 

gatcgtcttc tctcaccagc gaccgccccc taagcccaca atgcattccg agatcactag 720 

ctctatattc aaccgcatca gcatgacgtg tactacgtac tactactctt ccaccaggta 780 

taagtacaac aacattattc ataaagattg a 811 
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<210> 52 
<211> 688 
<212> DNA 

<213> Homo sapiens 
<400> 52 

ccttcaaagc ccaaactcag cctcctctgg ggtgacttcc tgactggtca ctgccatggg 60 

acccacctcc tctatccagt cattcagcta atacccaagg agcacctact gcataaggac 120 

ctgggttacc tgcagcagtg gctgaaggcc tttgtaggtg ccttcaagaa gagcatctca 180 

ctgtcctctc tggagccacg aaggccagag gaggcaggtg cagaggtccc gctgctacca 240 

ctggatgagc tgcatgtgct ggccgaacat gctgaaccat ggtgaactgg agcaagccct 300 

cctgctgctc aagctcttga tcatgctctg cacgaacctg gagagcatac aggcgggacg 360 

gcgccaagtg ctagagcacc gagtgctgtc gctgtggacc agatacctgg cggagctgaa 420 

aggatgccca ccaccccagg gccgaggcac gcagttggag aatgtggccc tacatgctct 4 80 

gcttctctgc gagggcctct ttgaccctta ccaaacctgg cggcgccagc gcagtgggga 540 

agt cat cage tccaaggaga agagcaaata caagttccct cctgctgctt tgccccagga 600 

attcagcgcc ttcttccaag gtcaggcccc gcccctgccc ccacttggct ccacccccaa 660 

acctaggccc cttcctgtcg ttccttga 688 



<210> 53 

<211> 323 

<212> DNA 

<213> Homo sapiens 

<400> 53 

gtgegattea tctttctgtc tctaggtctt agcatgtgca eggcacagag caggcgctca 60 

gagtgectte agtgaaggaa taaatgacca taccaccagg cacagagtga ggectttaga 120 

aaccctgagc ateategggt gcgcccacca cttctgtatg tgactcaaag taagtgtaag 180 

gaaatccagt tctgttttgt ttctgaaata ctagatgtca ggaagggtgt ttaggtgtca 240 

ggctttggtg gcctacactg tgctttcgga attattcact gaggegaaag aacagagatt 300 

agctactgat gaaggacaaa aag 323 



<210> 54 

<211> 339 

<212> DNA 

<213> Homo sapiens 

<400> 54 

gtgegattea tctttctgtc tctagttctt agcatgtgca eggcacagag caggcgctca 60 

gagtgectte agtgaaggaa taaatgacca taccaccagg cacagagtga ggectttaga 120 

aaccctgagc ateategggt gcgcccacca cttctgtatg tgactcaaag taagtgtaag 180 

gaaatccagt tctgttttgt ttctgaaata ctagatgtca ggaagggtgt ttaggtgtca 240 

ggctttggtg gcctacactg tgctttcgga attattcact gaggegaaag aacagagatt 300 

agctactgat gaaggacaaa aagaattttc tgetgaate 339 



<210> 55 
<211> 1187 
<212> DNA 

<213> Homo sapiens 
<400> 55 

ggaggcaggg ggccgggcag taacacttgc ttgcccactg ctcacgtctg atgggtgccc 60 
ccattcgtca cgaggctatg gaccagtacc catctgcagc ccagggttgg ggacccccac 120 
catacatget ctgaaaataa gtgtctttaa gttgtgtttt tctgtgtgac atcatctctg 180 
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aacttgaaca tgttcattac ggcaagattg taatcctgct gtattcctcg ccctccgtat 240 

tctatccctt taaaaactag atatttaaac attttatcct aatgtgtttc atcagaacac 300 

tttgcacact ttcttactaa aacggtatgt gggtggtccg agacagaatg gtacctacag 3 60 

tgcaggctat gtttctgtgg gaatggaaaa gaggacttta gaaagtgaac ctgaagaact 420 

cttatttcta agggaatcca gtcaagcttg agaaaatgaa atattcttgt gtgtagtgat 480 

tcagtctcag gatattgaag tcctttttaa acgtccagtg ttctcctgta agaatacaat 540 

ataaattaat ggcaagcttg cgccaggtgc ctgcccctcc taagcacttt acacgcacag 600 

ctcattcaac ctcacgtgtc gggagcatca gggagctcac cagtctcttg aggatcacag 660 

gccaattgct tgtccttccc gacccaagcc ccaccccaca gtgacttgct ctcttctgat 720 

tttacaggtt ttttggcagt aatacacatg ggagtgaaat tgaaaacctc cctattattt 780 

tccctctttt cctggaaagc tgttttttgc cctcacaatc attcattact tctctaacta 840 

gggggaagta ctacaagctg tcctcaggta ctgcccctac gtgtgtttcc ctgggatggg 900 

gccttgctag aggtgattct gctgccccag cattgggaag caggaccagt gcatgtgccc 960 

catgcagcca cggtacatgg aagctgtcac ttgagccatc agaccggctc agcccatgtg 1020 

accggagctc tgaagaggca cacacgcatg ccccgcacag gctgttagca cttgtcgcct 1080 

cgctgccctg gtccagacta cccttgcttg cgtcagtcaa aactgttgta actactttga 1140 

gtcttttctg catgattgct ctttgagaga actttacaag ggtgttt 1187 



<210> 56 

<211> 327 

<212> DNA 

<213> Homo sapiens 



<400> 56 

ccctgccggc ttcgtgacct ggaggagtca cagaactacc ctgagcctca ggtgagaaaa 6 0 

ctggggctca ggaaggcagt gagaaactgg aagtcagcca cgtgtcagga caggagcctg 120 

gatttagccc acatctttgg ctttggtttt gcacacagct gctgctggga gctggggcca 180 

gggcaggcag aaataatcct gggctctgca gagcctgttt caggctcatg gtctacccta 240 

ccccgccccc atttccggca aaaccaggct gagggctggg ctgctagcga agctgacagc 300 

aagagtgaca gccctgtcct gtcgctg 32 7 



<210> 57 

<211> 1471 

<212> DMA 

<213> Homo sapiens 



<400> 57 
gaaaggccac gtggttgact 
tagaatttga agagaaatcc 
cttccctttc aggtgctcct 
tcagcccctg aggccaagca 
gcccacatgt cccagaaggc 
ctcaggaagc atttcccagt 
gaaatgccaa cagcctccca 
attgtgtctt ctgaccttgc 
agggtcaagg tgtcccatgc 
gttagcatat gcttggaagc 
caggttccag atgatattta 
ccagacaggt cagctgaagt 
gatgggagaa gggtgactcc 
agcgttgcca gccacacacc 
gacaactcag atgacaaaga 
ccttcatctc cacggactca 
ccaggaagag cctcagactg 
aaaactaaca gtgcatcgaa 
accgagtcag acacactgga 
ctttcaagga gtgagaccaa 
cacctggcac ggggtgaact 



tattcctaga ccctccactc aaagtactca gctaggaagt 60 

aggtccacag aaactgcctt ctcattggcc actgttgtct 120 

cgcctgcttg cagatatcag accccagctc atctcagacc 180 

aacatacatc atcatggagg ctgcagacgt agctcaccgt 240 

agggggcttt cgcaacattg caatccaaac ttcccccagt 300 

tttcaaaagg aagagactca cagccagtaa gtccctggtg 360 

aagtgccatc caggtcaacg gtaacctctc tgaacaggac 420 

ctacttaagg ttggctcagc atcttgagga tgggcctcga 480 

attcctccca agggtcccca aggtgcaaag caatggtcct 54 0 

aggaacttgg aggtccttag agaaagccac agctgccatt 600 

tcacagtcct tcctgggaag ctagagagtc tgctctcagc 660 

aagtaactcc atacaccctt tggatgacac acgtccaggt 720 

actagattca gaaaaatcca cctcctgctt aaatgccacc 780 

aggcacagag gaacttaaac ctgaattgct tttgcccaaa 840 

ccttggctca ctgtcatctc agtcaaagga aacgtgtgtt 900 

cagttccccc tcacaaggct cccacagcca gccagcccac 960 

tccttcatca agtaacaatc accagaatct ggtgtcactc 1020 

atctgcccct gggtgtcagg agcagacggc aaacaacccc 1080 

gtttccaaat tgtccaggaa gtaatcatct cccatcctct 1140 

acttcagagc aacagggaga ttagtgacat taatcaaatt 1200 

ctgcgacctc caaggccgac tgcaatctgt ggaggaatct 1260 
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ctgcactcga accaagagaa aattaaagtc cttttgaatg taattcaaga cttggagaaa 1320 

gctcgagctc tcactgaagg cctccttggc tctcctctta ccattgaaca cttggataca 1380 

tcctatctca ccaagagcac agacccaaca ccaatgccta gagattctat ccatggctct 1440 

caggagcttg cagggatttc cgtacataaa g 1471 



<210> 58 

<211> 811 

<212> DNA 

<213> Homo sapiens 



<400> 58 

atggacagtc aagaagtaga gaaatacccc aacacttctg tagcttgtga ggaaatcccc 60 

ttttcaggaa tccatgttgc tggtgggaaa tcaggagcat tagaacatgg aaaggatgac 12 0 

cttgatgaac ccattgaaaa tcccctcttt tgtttctcaa gctttagcaa tgcacttgca 180 

atattgcttc caaaagtgtt ccttaagaat attcatatac tgcaattcat ctacaggagt 240 

tttcaccttt tgacaatggc aaaagcaaag tttgaaggtg ctgagtctgt ggagccagtg 300 

tcaccttcac agcccaaaag gccatcctat gtccccctag aagagctatg gacgaggtta 360 

acaaaaggga acagcaggcc tcagcagagg gacagggaga agggaggatg gatgaaggga 420 

gtgcagcaag gccatcaagg agtaggaaag caggaggaag gttcagagaa catcaaagaa 480 

aaagctggta actggggaga tgggcggagg tggggaagaa atcagcaggt tttgattaat 540 

aagattactt ttaagcaaac tagcttgaat tatacatcca acttagtctt tcctgaagaa 600 

agctatttta gcaatgcgta ttccttgagt tggactcata ttgtcctaca tcattgtgaa 660 

attcatttac ggaaggttaa gattggatgc cttcttaggg aaacgaacct aaaagtccgc 72 0 

catgcactat cagttacttc agaacttgga gcagatatca gcagacttgc agggtttgat 780 

gaggcatata aacttacaag tattggctct g 811 



<210> 59 . 
<211> 593 
<212> DNA 

<213> Homo sapiens 



<400> 59 

attcccgggt cgacgatttc gtctccttcc cccttccgga agctatagac acttccgaga 60 

gagcgcgaac ttgcggagca ctttgcatgc gcgccgcggg taggactgga cggtcggatt 120 

tgtaggcaat tctgacaccg agctcccggg ctcggttggc agaagatcgc tttgggaatc 180 

ctcctactct acgaggacca gaaaccaagg acgccaagcc atacaaattc agcacagctg 240 

actcagagag gtggaaagga aatcaggaca aaaagccaca atgtcttctg gaggaggata 300 

ttgccagcca gaataactgt gagagatgta ggcaagcata gtcctggaaa caaacagcac 360 

aattccttta aagccctaaa ggatgaagat cttcctgtag agaagtattt aatggaaagg 420 

cagcctgtag gtgagccagc tgccgaccag gtagctatgg atgtgatgca cagccccatc 480 

ctccaactgg aaaggaaaca caaagtctca agtgacagca accagacaga gactactgca 540 

gagaagttgc cagaggactc tttcctaaaa aacaagcaaa aacaggttta tat 593 



<210> 60 

<211> 538 

<212> DNA 

<213> Homo sapiens 



<400> 60 

atgaccattg tcagaaatgg tgtcagagac aaggatatca ctgcagacat caaacggata 60 

ataagagaat actatgaaca attccatata cataagttta caaagattgg aaatgcttat 120 

gctgttttaa gtaatccaga aaagcgaaaa cagtatgacc tcacgggcaa tgaagaacaa 180 

gcatgtaacc accaaaacaa tggcagattt aatttccata gaggttttgt gaagttgata 240 

taactccaga agacttgttt aatatatttt ttgggggtgg atttccttca ggtagtatac 300 

attctttttc aaatggaaga gctggttata gccaacaaca tcagcattga catagtggac 360 



37 



WO 01/74836 



PC77US01/10472 



atgaaagaga agaggaaaga ggagatggag gtttttctgt gtttatccag ctgatgccca 420 
taattgtatt gatcctcgtg tcattattaa gccagttgat ggtctgtaat cctccttatt 480 
ccttctatcc cagatctgga acagggcaaa ctattaaaac gcagacagaa aacttggg 538 



<210> 61 

<211> 816 

<212> DNA 

<213> Homo sapiens 



<400> 61 

atgcctaaga aagatgatag caactggggc tatgcgttca cagatgggga gcaactagga 60 

gggccaagtg caaaaggtaa gatcaagtgg cagcccacca tttgcacttg gcccactttt 120 

gcaaaggaga gtgggcgctc cctttctctt cctggaagat cagtcccacc ccccatttct 180 

acatctcctt gggtatacca gcctacttat agttactcta gtaaaccaac cgatggacta 240 

gagaaagcaa acaagagacc aactccttgg gaagcagcag caaagtctcc tctcggtcta 3 00 

gtggatgatg ctttccaacc cagaaacatc caggaatcca ttgtggcaaa tgtggtttca 360 

gcagctcgga ggaaggtgct tccagggcct ccagaggatt ggaatgaaag actgtcctat 420 

attcctcaaa cccagaaggc ctatatgggc tcatgtggaa ggcaagagta taatgtcaca 480 

gccaataata atatgtccac cacctcccaa tatggttcac agttgccata tgcatattat 540 

aggcaggctt caagaaatga ttctgcaatc atgtccatgg aaaccaggtc tgattactgt 600 

cttccagtag ctgattacaa ctacaaccca cacccaaggg gatggagacg ccaaacatga 660 

aagttagaag aacggatcat gtgccaactg tagtttttta aaaaaaacgc tcctttgtag 72 0 

ggttttaaac ttttctaata gatttagatt cacttttggt cttggcttgt tctcataagt 780 

catttatcta agt.ttgtgtt tctgtgtgtg tgagtg 816 



<210> 62 

<211> 646 

<212> DNA 

<213> Homo sapiens 



<400> 62 

gatgggggca aaagcagatt ccttagcctg cttttctgat gagagatcgc gggggagcca 6 0 

aaggcagttt gcaaaagagg accagtctct gcgcaggagt tttggtggta aacatgctgt 12 0 

atcggctgtg ggtgggagtg cgtatcagca ccacccccgt ggaaggctac ttgatgcgga 180 

taagttccat ccactggcaa acccaaaagc tacacatgtc tacatgggaa gggctgtgcg 240 

gcacaggtgc acagcagatc tcagcgctct tcccctgatg tttctgctct tcccatgcac 300 

taaggagctg agcaaacagc aagtcagaag gcagggccat ggagggtccc gacctgcgga 360 

taaagaccta gaggaagggg gaggccttga ggcggagtcc cccaagcaga gccccaactt 420 

gtgtgtgatc ctgcgccaca acttggccag ccggcctgga cagctggctc tggttactgt 480 

gggaacaatg caaggaaggc cgttgtcaca ttcctctgag gtcaaaggca caaccttcgt 540 

cacacactca gtccctgctg gcaaagagaa agacgaggag cgtggaatcg gagacctgga 600 

gcatgcgagg gacttgagaa attcaccaac tcccttgttt tactga 646 



<210> 63 

<211> 590 

<212> DNA 

<213> Homo sapiens 



<400> 63 

agatatagaa tgattagaga aatttgaaag attttcagag tctttttttt ccaattcatg 60 

gtatgaggtc agatggttgc ttgtaagtgt tttcaaactg ctaaggagcc ttctgttgga 120 

gcttgggagt cagaaacctg ggtctcacct ggctctgtga agtactcaca tggcctttga 180 

ccttggccaa ctcacttaac cttgttgagt gaatgagagt accatggagt aagtgcttga 240 

gtgcaggcat gcctttgcct tctcagaggt tatggttaag tgaattaata tatctgaagt 300 

tcttagcatc aggtctgttt aatggttatc ccaatccaga aaatttttcc tggacagaat 360 
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acctggaagc tactcaaacc aatgcagttc ctgccaaagt ttttaaaatg gacagtgatg 420 

ttggggagaa cagaaagatt ctccgggatg aaagacctaa ttacagtcag tatactccat 480 

ttagtagatg tgataatgca tcttacaaag aaaatgtgtt tctacaaaag ctggagagaa 540 

acacaccaga tattgcagaa agatttgact gtttattact gacatattag 590 



<210> 64 

<211> 386 

<212> DNA 

<213> Homo sapiens 



<400> 64 

agttaatgtg taagagctgg cagtgctctg taaatgcaca gctgcagcca catcttgggt 60 

tgctgggaag cctttgggtc atgctacctt ccctggttca gttagctata gctaggagaa 120 

aagtgtggcc ctttggtcca gaacctggcc acctgatctc tgaatttggg cattttgaag 180 

ggactgtcct tgacaaacca tcctgggtca cgtgcagaat tcttggagga ggtagactgc 240 

aggggatggg ctggttgacg attgactttt ctcctcaagc atctaaacgt gccctaacgt 300 

cttccattat ggagctgggg ccaggtgtca gcaaactggt tcagtggcag ctggagaacc 360 

taaaaaggca ttgtgggcct ctgtag 386 



<210> 65 

<211> 615 

<212> DNA 

<213> Homo sapiens 

<400> 65 

ctgaggggtt ttaaggcgta ggagtccttc tgtttgtttg gttttgttcc cctgtagcca 60 

tttctgctgc tgtgaagaaa tgagcaggga ggtggcaagt gtgtgggtga taatgacaac 120 

taatgttttt gagcagtgag tctgagtcag gccttatgcg atgcaaggtc catgcaccgt 180 

cttattcaaa ccttccacac ctgtaagggg ttctctttaa tcaacccatt gtacaaatga 240 

gaaaaccaag acttgtgaag attttgcagg ttgcccaagg tccccataga ttaagagata 300 

aagcaagact tcaaatccag gccagtgtga ccctggagtg tgtggactca cccctgcagt 360 

acactacctc tcaccatcct tgtaaaaaga aaaagtaagt tgtcagggca agagcactga 420 

tgcaggagtc aggaactctg agagctggtc ccagctttgc cactcacttg ttacatgacc 480 

tttagcaaaa ttgcttcaag gggaaaggag ttagcattgt gtgtgcacct actatgtgtc 540 

agttgctggg ctatgggctt agaatacaat tgattatgaa atcctcaccc cattaccaca 600 

agggaaatgg gatgg 615 



<210> 66 

<211> 613 

<212> DNA 

<213> Homo sapiens 



<400> 66 

cttggtccca cagtgagaac gctttgctat cacatgggca ggctctgaga gccctgccgg 60 

cctggccttc tcaaagaaga cctgagagct tgggacccaa gcagagagga agaacagggc 120 

tcagggtgct tgctccatgc tcgctccaca cctggggctc aaccctggct ttccccggct 180 

ccctgtgtga cttcagggca ggtcccttgg gccctctggg ccttatcatc ttcatctgta 240 

acagggcgat gcctctgccg tgtctggtgg tgttgaggag ttcctgtttg tgtaagcagc 300 

tagttcagtg ccagcacgag atgggaggcc catgaagtta gcagtgcaca aaaaatagag 360 

caaagactgg atgcatttcc tgagaacaac catcactgta aagcacttta caaatccaaa 420 

gacaaccccc ggcaaaaact caaaatgaaa ctccctctcg cagagcacaa ttccaattcg 480 

ctctaaaaac attacaagtt agttcatgtc atgccagata gctgaaggca gcttcacaag 540 

ttcttaaggc caggaatgcc atgtgtctgc tatgcacagc tgggccctgg cccttgagcc 600 

ctgatgacag cac 613 
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<210> 67 

<211> 930 

<212> DNA 

<213> Homo sapiens 



<400> 67 

tttcgtcccg cgtgctgcgg tggcgatcag cgcggacacg gagcctggct ccacccgtcc 60 

gcccagcctt ggggttctgc cagagcgccg ggcgcagagc aagcgggtct gagctccagc 120 

acctttggca gctcccggcc ccggggtgcg aacggcggcc gtggctcacc gcttcaggag 180 

caggcgacca ggacactagg aagagaagcc cggaccaggg ctagtttgtt tccccaactc 24 0 

ctctggccag ctcaggttct taaacccgcc gtgcaacaga cgcaggtacc ggctcagaaa 300 

gctcagtcag tggtgcagga agcctgggtc gcggtgcatc gtgtttaaaa ataaacacac 360 

acttcccttg gggccagtgg catttattct atttaaatta catagaatcg ttccattcct _420 

ttgtgtctga aagattaagg ctgaagtctg ctttcccagc tatttgcatg tatttgttta 480 

gggatggagt cgccctgcaa cacgggctgc tcctatggca actgctacaa aacaagatca 54 0 

ggcttcacca gggtcgggag aagaaacctc ccaagaaagc caggcgctgg agtcatgggt 600 

gtggatgtcc ttttagggcc taggaaagta gggtctgatg aggaggcatt gaagaaaagc 660 

cagtatgacg acgatgcttt catggtgaag ttttgtgcat cttctactga acatttttta 720 

aacaaccccc aaacagatgc agttttgggc cagcgttatg gaagagaaag caaatccccc 780 

aaatgtctga atcacaaaca cagagacctc acccccgtga cagtggaaac aaatgccctg 840 

gcctctgacc cctacagatc cgcgttcctg cccctaaacg gtgcagtgag gggagagctg 900 

ggctgcagaa caggtggaga gaagaacgtg 930 



<210> 68 

<211> 239 

<212> DNA 

<213> Homo sapiens 



<400> 68 

tcctgatcag ggatgagtcc gtgctgttgg gtttattcat ttactttagc catcttggag 60 

ccgtgtcagc tgcagagact ggagtccggg gcatgatggg gatgtacctg aaacctagtg 120 

agagtcatcc tgactctggg gaagagccgc tcaaagatga gaaagataaa ggggtaacca 180 

gaggtggtgg tgtggtccac tgtgactcgt cagctgttgc tctgtgtgtt catttttga 239 



<210> 69 

<211> 336 

<212> DNA 

<213> Homo sapiens 



<400> 69 

cacaaaatgc aaccactgag aatttattgc attacaggaa cttggaaatt caccttctaa 60 

atatctggta agggtatgtc taactgctga atttcaccat aattcttaca tactagccaa 120 

tttctatttc atgctgttta atagggcttg ctgtgaatcc gtgctacctt tgaatttttc 180 

agcacagtaa atgtaagact tctagacatc tgcaagaagc atataagtaa aagagctcca 240 

ctgtaatgcg aatggcatta gactcatgca tgcaacttta ttcagtcttt cctttgtgtc 300 

acctcaacac atcaatggtt tgacaagtga aaaagt 336 



<210> 70 

<211> 489 

<212> DNA 

<213> Homo sapiens 
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<400> 70 

cggaattccc ggatcgacgc tttcgtacgg cctgcacctt cgcagccagc cttgcctttg 60 

attttgggcg gcacatgggc cgggtgcgtg ggcttggttt ggatggggac ggggttttgc 120 

ggcgcgcctg agttttgaca ctccaacccc accgaaagtc cggaggagcc gggtgtgctg 180 

ctcgcgtctt tgaaaggtgg aggcaggaga agtagggcaa ctggtgtggc tgcatgctga 240 

ggcacatgat ttaaaaattt caactgctga tattctttcc gaggcgcgga gctctgctgc 3 00 

tgtttcaggc tgtgtccaga cccaggaatg tggtgtgacg atcaccaatt cctccaacct 360 

ggcagcaaca tttgctgctc ctttggaatg gctgggggtg gggcacgggc ggaaaaaaac 420 

tctgtgtcct ggctacatca aaatctgttg gggtggatct tggactcagc acccagggat 480 

tctgataag 489 



<210> 71 

<211> 362 

<212> DNA 

<213> Homo sapiens 



<400> 71 

gaaaagagtt gttgactttc agggagccca aacctagggc ctccccaagc cagggttgtg 60 

agacctttga ggctctatgg ttcctagcat ttccaagctt ctgggcacaa ctgcattacc 120 

cggtacccat agtggaagcc acttgcagta tgcctggtcc agctgcagcc tcacacagag 180 

ctagcaccta tgttagcact tggagttgcc cacctcacca cagctggcat gcctggcagt 240 

gcacagtggc cagacctcac ttgcagacct ctcattgctg cacatctggc ttgcccttgg 3 00 

cagacatgga atccaggctg gttgcttcac caagtgagtg gaacaagctc acttgggccc 360 

ag 362 



<210> 72 

<211> 265 

<212> DNA 

<213> Homo sapiens 



<400> 72 

aatgagccta tagaaaaaca cacacacaca ctgaagataa tcagggtcac tgaattgaat 60 

gtgatttttg gtgagattga aattgtgcaa gtccaggcct atcagtggca agtaggagga 120 

ggagatgatc agcagtgctt tcctcctcct aacccttatc cggtcttatc tgctgctgcg 180 

gtatcaaaac accacatcga tgacagaatt ggacccacgc aggctgcact gcaccatctg 240 

gatggaagtg tagaggcctg caaag 265 



<210> 73 

<211> 390 

<212> DNA 

<213> Homo sapiens 



<400> 73 

tttcgtgtga caaaagggaa ggacaagtga gtggctgcat gaggataaaa atgccctttg 60 

gcaacagctg tattcagtga gcctcttccc cagaggatac gccctccctg cttcctggtc 120 

agagccgggg cttgaggtca gctggtggga agatgagcag ggcccccggg atcctggctt 180 

cttggagaag agccccctcc ctttctgtgc agaagggcag ccagactgct agacacacct 240 

gccacccaga ggttccgctg ggaaactgct tcttacctgt ctacaaggcc agtcccctca 300 

cagttacccg tctttgggct gaaaggtaag aggggagggt ggtgacggac tctgctggtc 360 

acttggctct tccactgagc aactcacacc 390 



<210> 74 
<211> 362 
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<212> DNA 

<213> Homo sapiens 



<400> 74 

cccatgtggt gaactgctgc ccatcctcaa aaaaaaccag ttaaacatcc ttctcttgca 60 

accagtggac acagagactt tggaagggcc tccaggcctt ggcctggatg cagagggccc 12 0 

tgagaagaga cacaggtggc agtcagtctg actatcctct cgccctgtcc tcatttccca 180 

cagctggatc ctcctgccct gccctgggat tgaccacacc tctggcttgg aggttatgtc 240 

tgatctttat cacaggaagg gaaactcgct ccatccccaa ggaaagagaa ccaaggatgc 300 

cagaaaggaa agcttccccc agaagatggg acaatttccc ttacaaagtc tggcagtcat 360 

ct 362 



<210> 75 

<211> 340 

<212> DNA 

<213> Homo sapiens 



<400> 75 

atgcttcttg ctgacaaaga ggcctctgaa gcaggcctca ccaatgttcc caacgatgca 60 

aactatccaa gatacagtcc agcagagtgc ttaatggcca taggctttgg tgtccggaag 120 

tctaggttgg agcacttgca tctgcaacag tgtgcaggtg attttctctc tgcaagcctc 180 

agcccaacta accagatatc agtatctgtg tgggcacata tccataagct ggcccgtggt 24 0 

aattcttaag tacgctgcga gcgtctatgg tatttctcta gcagaccgta agcgacaata 300 

tgacagatac tttcgatatg aacgtttacg caccataaag 340 



<210> 76 

<211> 390 

<212> DNA 

<213> Homo sapiens 



<400> 76 

tttcgtggga gacgcaagcc aggaggccag ggctcctctt cggctgcccg ctgctgagat 60 

gggcgcggca gccaaggggt cgttgtgggc ctcccctttc tctcgttcag tgctgcccct 120 

ttgtcttggg tccagtctct gactcctcgt agctgctcgc ttggaccagg ctggccgtgg 18 0 

tgacagcact gtgtcgtccc cgctcctagg cagtgggcct tccgggtagt agcggtcgag 240 

tcccgcctag gctctcagga ctccacagcc cctgagtatt tgactcaaat caccagtcca 300 

atatccctcc cgttaccagt gaggaacctg gggcccagag aagaggacgg agtcttgccc 360 

aagtgactca agaccctctt aggaacttct 390 



<210> 77 

<211> 614 

<212> DNA 

<213> Homo sapiens 



<400> 77 

tttcgtaaga aatgaagcca gagagttggc ctgggggcag accatggatc tctttggaca 60 

tgccaacaag tttgggtttt attctgattg aggagggtca tatattgtgt catttattct 120 

tcacagccac ccatgaggct aattttattc ctatttatag atgaagtgcc tgaagcttag 180 

agggggctta gttaaggtca cataactggt aagtgatagg gctggggttt gaccacctgc 240 

tctggatgcc aggatctgtc cttttcagct acagctgctc cttggaagct gccagtagaa 300 

aatctgctgg atgcttttca gtgattctgg tgtaaaactt caaaggatgc tttgggtgtt 360 

tgtgagagca gatttgaaag atggtggtgg taacaaattg ggtccaacta tataggtcct 420 

gtgagttcag ggtaaaggat gaggtgcttt tgctatagag aaccctggat ccaaaggaga 480 

ggacccgtgc agagtctgag gggaactgct ggtagttaga gtcagaaagc atgttccaca 540 
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tgggtgaaga gggtagacca caggtaaata ggggttgtag atggaggtaa ctgttggaaa 600 
tccgggaaaa tggg 614 



<210> 78 
<211> 627 
<212> DNA 

<213> Homo sapiens 
<400> 7B 

cgctccattt ttcttttccc caaatacatc taagataaaa ttgtggatat cagaaggttc 60 

ataggggtcc cccttccacc agcctgccaa ggagaattct gcagtccctg gcccagcttt 120 

cccgcctcat ccctggagag ggatgcctgt ggaaggcttg gtgattggga ctcagagaaa 180 

caggaaggga gaaaaggttt cccctccacg aattcagtca ttctacataa aaaaagaaca 240 

aaaaggagtg tgttgggagg ggacaagttt tggcaaaaaa aatctccaga aagtccacaa 300 

ggggacttcc aggacagtag gagcctcagg tccacagtga agtttcccac gtccacaagg 360 

aagttgcaag gggaagtgga aaagagcagt gttgccagaa attatggatt tgatcctctt 420 

taatgcctgt gaagatacac aaaaacatga aatttctttt caatattttt gggctgaacc 480 

acttgggtgg tggctgaaat gagaccatac atacagcaca atgcatgtct gttatattgc 540 

aaagtgtatt acctaggtct catacatccc cctagtcatc agcattaatg catgttggga 600 

ccccgaccca cacaagcaga agtaatg 627 



<210> 79 
<211> 255 
<212> DNA 

<213> Homo sapiens 
<400> 79 

cacagcccga ggtccgtggc aggaggagat gacaagcagg gagaacacca acggaataaa 60 
ctctgcagtt tccttcagat catccctgca taggtgtttc tcgaaactca tgttcagtga 120 
gaacagcaga catagactca ggacaaaccc tccgtctcct ttcaggttag gttggcccat 180 
gggctggagc gtctgcgtct tggtcggccc ggcagtgtac atgtcaagca gcgcgcaaag 240 
actccacagc cattc 255 



<210> 80 
<211> 231 
<212> DNA 

<213> Homo sapiens 
<400> 80 

aaacaacttc agttccatct gatgtagctg actttttgat gagactaaaa ctgtgtagtt 60 
catcatctcc atccaagttc agatctcaag tctatgatct tgactacctt aatgtatggc 120 
tattattatc ttgatgatca cattggcttt gctttcggca caccccgatc tctctccctt 180 
tcttctgatt ttctccactc aaaacaagag gggtattttt ctccgacatt g 231 



<210> 81 
<211> 415 
<212> DNA 

<213> Homo sapiens 



<400> 81 

tttcgttgga tagtcacttt tcaaaggtca catgtccctt gcttggaccc catcacacac 60 
aacctgctgg tagtgtctgc ctcctgacag gtgccactgg ccatgggctg tgggagccca 120 
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gaggcttcga aggacccctt cagcatttca tgcgataggc tccattcatt gtactgctcg 180 

gggctgtgag tggctgcgtc tgatcgtctc ttaatgtcca gtgtagggag cactgaggca 240 

ggcaggtctg agcaggctga aagggggatt ggtgtatcag agagagtagg agtccaggct 3 00 

caggatcgga cagagctgag tgcagattgc aacactgtga ctcccaaggt caaggcttcg 360 

tgatcttggg aggttattta aaagaggccc tatggtgtcg tggtaagagc atggg 415 



<210> 82 

<211> 291 

<212> DNA 

<213> Homo sapiens 

<400> 82 

gtcaagtgct acccgttgca ggcgcagggg ccacgcgggt ctggttggga gatcgagcct 60 

ggtcgatccg ttttgggcgg aagagtagga tttattcctg aaagtgtgga ggcgcgagcg 120 

tttgggtccg aagttgagtt gcagggagat gtgtggcctg tggaccatgg ctcttttact 180 

gcaaagttct tttgcttatt tctgaaactt acattgttta ttcgctactg ggctggagaa 240 

gccaattgta cagaatttaa accgtctctc agatgtgtac agtagaactc a 2 91 



<210> 83 

<211> 426 

<212> DNA 

<213> Homo sapiens 

<400> 83 

aaatcaaaag ctctaacagc actccaggtt gcttttacaa gccagcctgt gaagctctaa 60 

cgttctctag cgaaaccagc gaatcgccaa cttccgagaa gaaatgatgc gaaaatccct 120 

cccgcactga cacctcagta agtgaaagag acttcttttc tctctccgta acctccctcc 180 

tcctcactgc cgaatagtgc ggtgcttctt aactgagacg acgctaaact accagaagat 240 

ccttagtgtc cgtccaggaa ccaaactcgc gaccgcctcg cacgtgtcag gcctaggact 300 

acagacacca cctttcggcc ttgctcagca cctcatccgc ccccacgcct tccttgcgcc 360 

caaagacccc ctcacctcct ttacagagag aaattcaaga agcgacgaaa tcgtcgaccc 420 

gggaat 426 



<210> 84 

<211> 58 

<212> DNA 

<213> Homo sapiens 



<400> 84 

tttttttttt ttaaagtata aaattttttt ggaaaaaaag gaaaaaaatc tatataaa 58 



<210> 85 

<211> 101 

<212> PRT 

<213> Homo sapiens 



<400> 85 

Met His Cys Gin Lys Lys His Pro Tyr Trp Tyr Gin Gin Lys Ser Gly 

15 io 15 

Gin Ala Pro Val Leu Val lie Tyr Glu Asp Asn Lys Arg Pro Ser Gly 

20 25 30 

lie Pro Glu Arg Phe Ser Gly Ser Asn Leu Gly lie Met Ala Thr Leu 
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35 40 45 

He He Ser Gly Ala Gin Val Glu Asp Glu Ala Asp Tyr Tyr Cys Tyr 

50 55 60 

Ser Thr Asp Ser Ser Gly Asia His Arg Glu Val Phe Gly Gly Gly Thr 
65 70 75 80 

Arg Leu Thr Val Leu Gly Gin Pro Lys Ala Ala Pro Phe Gly His Ser 

85 90 95 

Val Pro Ala Leu Leu 
100 



<210> 86 
<211> 57 
<212> PRT 

<213> Homo sapiens 



<400> 86 
Met Arg Lys Glu Gin Glu Val Thr 

1 5 
Thr Glu Thr Val Thr Ser Ala Glu 
20 

Arg Tyr Arg Ser Lys Leu Ala Asn 
35 40 
Tyr Arg Trp Val Leu He Phe Arg 
50 55 



Lys Lys His He Cys Glu Leu Leu 

10 15 
Thr Pro Asp Ala Leu Ala Lys Ser 

25 30 
Gin Met His Phe Leu Gly Leu Lys 
45 

Met 



<210> 87 

<211> 99 

<212> PRT 

<213> Homo sapiens 



<400> 87 

Met Ala Leu Val Ala Ser Pro Ser 

1 5 
Lys Leu His Pro Lys Ala Tyr Gly 
20 

Met Leu Asp Pro Asn Thr Asn Gly 
35 40 
Pro Lys Thr Gly Val Val Gly Trp 

50 55 
Val Lys Glu Gly Met Asn He Val 

65 70 
Pro Gly Met Ala Arg Pro Ala Arg 
85 

Asn Ser Lys 



Met Gly Arg Asn Leu Lys Asp Glu 

10 15 
Ser Trp His Leu Val Pro Trp Gin 

25 30 
Ser Pro Val Phe Pro Ser Ala Leu 
45 

Gin ser Met Trp Cys Phe Gly Lys 
60 

Glu Ala Met Glu Arg Leu Trp Val 
75 80 
Arg Ser Pro Met Leu Thr Val Asp 
90 95 



<210> 88 

<211> 144 

<212> PRT 

<213> Homo sapiens 



<400> 88 

Met His Glu Val Asn Ala Lys Ser Cys Cys Arg Val Pro Cys Gly Lys 

1 5 io 15 

Arg Val Ala Arg Lys Leu Arg Glu Asn Gin Gin Ala Leu Trp Arg Pro 
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20 25 30 



Lys 


Ala Pro Glu 


Asp Val Arg Pro 


Gly Asn Gly Leu 


Glu His Arg 


Pro 




35 


40 




45 




Gly 


Gly Asp Pro 


Gly Ala Ala Glu 


Pro Asp Ala Val 


Cys Ala Ser 


Arg 




50 


55 


60 






Pro 


Ser Thr Glu 


Gin Arg His Gly 


Arg Ser His Gly 


Ala Arg Thr 


Pro 


65 




70 


75 




80 


Gly 


Lys Thr Thr 


Arg Gly Ala Phe 


Ser Pro Arg Pro 


Pro Ala Pro 


Ala 






85 


90 


95 




Phe 


Ser Ser Arg 


Leu Val Gly Leu 


Asn Pro Thr Pro 


Ala Phe Ser 


Leu 




100 




105 


110 




His 


Phe Met Ser 


Phe Pro Gin Lys 


Tyr lie Gin Lys 


Met Pro Phe 


Phe 




115 


120 




125 




Pro 


Leu Val Thr 


Leu lie Pro Gly 


Phe Ser Pro Phe 


Ser Gly Ser 


Leu 




130 


135 


140 








<210> 89 












<211> 59 












<212> PRT 












<213> Homo 


sapiens 










<400> 89 










Met 


Cys Asn Gly 


His Val Trp Leu 


His Asp Arg Tyr 


Trp Asp Ser 


Gly 


1 




5 


10 


15 




Ala 


Pro Arg Trp 


Gly Glu Glu Phe 


Ser Leu Ser Lys 


His Pro Gin 


Gly 




20 




25 


30 




Thr 


Glu Val Lys 


Ala lie Thr Tyr 


Ser Ala Met Gin 


Val Tyr Asn 


Glu 




35 


40 




45 




Glu 


Asn Pro Glu 


Val Phe Val lie 


lie Asp lie 








50 


55 










<210> 90 












<211> 78 












<212> PRT 












<213> Homo 


sapiens 










<400> 90 










Met 


Gly Gly Gly 


Gly Leu Arg Arg 


Gly Pro Glu Gly 


Glu Leu Gly 


Glu 


1 




5 


10 


15 




Gly 


Arg Lys Arg 


Lys Ser Ser Ala 


Arg Val Val Glu 


Lys Lys Ser 


Pro 




20 




25 


30 




Pro 


Pro Ser Ala 


Tyr Thr Glu Ala 


Ala Thr Arg Leu 


Phe Ala Ser 


Pro 




35 


40 




45 




Thr 


Pro Pro Ser 


Phe Ser Gly Arg 


Gly Leu Leu His 


Pro Trp Ala 


Gly 




50 


55 


60 






Thr 


Arg Tyr Phe 


Ser Pro Ser Ser 


Ser Leu His Pro 


Arg lie 




65 




70 


75 







<210> 91 

<211> 1089 

<212> PRT 

<213> Homo sapiens 
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<400> 91 



Met 


Asn 


Leu 


His 


Ser 


Thr 


Leu 


Gin 


Val 


He 


Ser 


Leu 


Gin 


Val 


Asn Asn 


1 








5 










10 










15 


Leu 


Asp 


He 


He 


Val 


He 


Pro 


Glu 


Thr 


Ser 


Val 


Glu 


Leu 


Ser 


Gly Phe 








20 










25 










30 




Leu 


Gin 


Lys 


Ser 


Phe 


Pro 


Lys 


Glu 


Lys 


Asp 


Asp 


Leu 


Ser 


Pro 


Gin Pro 






35 










40 










45 






Leu 


Met 


Thr 


Val 


Cys 


Glu 


Arg 


Ser 


Phe 


Arg 


Glu 


Gin 


Gly 


Ser 


Phe Gin 




50 










55 










60 








Ser 


Thr 


Tyr 


Glu 


Arg 


He 


Thr 


Glu 


Val 


Ala 


Val 


Glu 


He 


His 


Arg Met 


65 










70 










75 








80 


Asn 


Leu 


Val 


Leu 


Leu Arg 


Thr 


Val 


Gly 


Met 


Ala 


Asn 


Arg 


Glu 


Lys Tyr 










85 










90 










95 


Gly 


Arg 


Lys 


He 


Ala 


Thr 


Ala 


Ser 


He 


Gly 


Gly 


Thr 


Lys 


Val 


Asn Val 








100 










105 










110 




Ser 


Met 


Gly 


Ser 


Thr 


Phe 


Asp 


Met 


Asn 


Gly 


Ser 


Leu 


Gly 


Cys 


Leu Gin 






115 










120 










125 






Leu 


Met 


Asp 


Leu 


Thr 


Gin 


Asp 


Asn 


Val 


Lvs 


Asn 


Gin 


Tvr 


Val 


Val Ser 




130 










135 










140 








lie 


Gly 


Asn 


Ser 


Val Gly 


Tyr 


Glu 


Asn 


He 


He 


Ser 


Asp 


He 


Glv Tvr 


145 










150 










155 








160 


Phe 


Glu 


Ser 


Val 


Phe 


Val 


Arcr 


Met 


Glu 


Asp 


Ala 


Ala 


Leu 


Thr 


Glu Ala 










165 










170 










17^ 


Leu 


Ser 


Phe 


Thr 


Phe 


Val 


Glu 


Arci 


Ser 


Lys 


Gin 


Glu 


Cys 


Phe 


Leu Asn 

jjwu null 








180 










185 










190 




Leu 


Lys 


Met 


Ala 


Ser 


Leu 


His 


Tvr 


Asn 


His 


Ser 


Ala 


Lys 


Phe 








195 










200 










205 






Glu 


Leu 


Thr 


Leu 


Ser 


Met 


Asp 


Glu 


Leu 


Glu 


Glu 


Asn 


Phe 


7i vet 


Glv M^t 
w j_ y nc l 




210 










215 










220 








Leu 


Lys 


Ser 


Ala 


Ala 


Thr 


Lys 


Val 


Thr 


Thr 


Val 


Leu 


Ala 


Thr 


Lys Thr 


225 










230 










235 








0 ACS 


Ala 


Glu 


Tyr 


Ser 


Glu 


Met 


Val 


Ser 


Leu 


Phe 


Glu 


Thr 


Pro 




Lys Thr 










245 










250 










255 


Arg 


Glu 


Pro 


Phe 


He 


Leu 


Glu 


Glu 


Asn 


Glu 


He 


Tvr 


Gly 


Phe 


Asp Leu 








260 










265 










270 




Ala 


Ser 


Ser 


His 


Leu 


Asp 


Thr 


Val 


Lvs 


Leu 


He 


Leu 


Asn 


He 


Asn He 






275 










280 










285 






Glu 


Ser 


Pro 


Val 


Val 


Ser 


He 


Pro 


Arcr 


Lvs 


Pro 


Gly 


Ser 


Pro 


Glu Leu 




290 










295 










300 








Leu 


Val 


Gly 


His 


Leu Gly 


Gin 


He 


Phe 


He 


Gin 


Asn 


Phe 


Val 


Ala Gly 


305 










310 










315 








320 


Asp 


Asp 


Glu 


Ser 


Arg 


Ser 


Asp 


Arcr 


Leu 


Gin 


Val 


Glu 


He 


Lys 


Asp lie 










325 










330 










335 


Lys 


Leu 


Tyr 


Ser 


Leu 


Asn 


Cys 


Thr 


Gin 


Leu 


Ala 


Gly 


Arcr 


Glu 


Ala Val 








340 










345 










350 




Gly 


Ser 


Glu 


Gly 


Ser Arg 


Met 


Phe 


Cys 


Pro 


Pro 


Ser 


Gly 


Ser 


Gly Ser 






355 










360 










365 






Ala 


Asn 


Ser 


Gin 


Glu 


Glu 


Ala 


His 


Phe 


Thr 


Arg 


His 


Asp 


Phe 


Phe Glu 




370 










375 










380 








Ser 


Leu 


His 


Arg 


Gly Gin 


Ala 


Phe 


His 


He 


Leu 


Asn 


Asn 


Thr 


Thr He 


385 










390 










395 








400 


Gin 


Phe 


Lys 


Leu 


Glu' 


Lys 


He 


Pro 


He 


Glu 


Arg 


Glu 


Ser 


Glu 


Leu Thr 










405 










410 










415 


Phe 


Ser 


Leu 


Ser 


Pro Asp 


Asp 


Leu 


Gly 


Thr 


Ser 


Ser 


He 


Met 


Lys He 








420 










425 










430 




Glu 


Gly 


Lys 


Phe 


Val 


Asn 


Pro 


Val 


Gin 


val 


Val 


Leu 


Ala 


Lys 


His Val 






435 










440 










445 






Tyr 


Glu 


Gin 


Val 


Leu 


Gin 


Thr 


Leu 


Asp 


Asn 


Leu 


Val 


Tyr 


Ser 


Glu Asp 




450 










455 










460 








Leu 


Asn 


Lys 


Tyr 


Pro 


Ala 


Ser 


Ala 


Thr 


Ser 


Ser 


Pro 


Cys 


Pro 


Asp Ser 


465 










470 










475 








480 


Pro 


Leu 


Pro 


Pro 


Leu 


Ser 


Thr 


Cys 


Gly 


Glu 


Ser 


Ser 


Val 


Glu 


Arg Lys 










485 










490 










495 


Glu 


Asn 


Gly 


Leu 


Phe 


Ser 


His 


Ser 


Ser 


Leu 


Ser 


Asn 


Thr 


Ser 


Gin Lys 
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500 










505 










510 




Ser 


Leu 


Ser 


Val 


Lys 


Glu 


Val 


Lvs 


Ser 


Phe 


Thr 


Gin 


He 


Gin 


Ala Thr 






515 










520 










525 






Phe 


Cys 


He 


Ser 


Glu 


Leu 


Gin 


Val 


Gin 


Leu 


Ser 


Gly Asp 


Leu 


Thr Leu 




530 










535 










540 








Gly 


Ala 


Gin 


Gly 


Leu 


Val 


Ser 


Leu 


LVS 


Phe 


Gin 


Asp Phe 


Glu 


Val Glu 


545 










550 










555 








560 


Phe 


Ser 


Lys 


Asp 


His 


Pro 


Gin 


Thr 


Leu 


Ser 


He 


Gin 


He 


Ala 


Leu His 










565 










570 










575 


Ser 


Leu 


Leu 


Met 


Glu 


Asp 


Leu 


Leu 


Glu 


Lvs 

2 *■* 


Asn 


Pro Asp 


Ser 


Lvs Tvr 








580 










585 










590 




Lys 


Asn 


Leu 


Met 


Val 


Ser 


Arq 


Gly 


Ala 


Pro 


Lvs 


Pro 


Ser 


Ser 


Leu Al a. 






595 










600 










605 






Gin 


Lys 


Glu 


Tyr 


Leu 


Ser 


Gin 


Ser 


Cys 


Pro 


Ser 


Val 


Ser 


Asn 


Val Glu 




610 










615 










620 








Tyr 


Pro 


Asp 


Met 


Pro 


Arg 


Ser 


Leu 


Pro 


Ser 


His 


Met 


Glu 


Glu 


Ala Pro 


625 










630 










635 








640 


Asn 


Val 


Phe 


Gin 


Leu 


Tvr 


Gin 


Arcr 


Pro 


Thr 


Ser 


Ala 


Ser 


Arcr 


r iVS T iV 1 ; 










645 










650 










655 


Gin 


Lys 


Glu 


Val 


Gin 


Asp 


Lvs 


Asp 


Tyr Pro 


Leu 


Thr 


Pro 


Pro 


Pro Ser 








660 










665 










670 




Pro 


Thr 


Val 


Asp 


Glu 


Pro 


Lvs 


He 


Leu 


val 


Glv 


Lys 


Ser 


Lys 


Phe A^d 






675 










680 










685 






Asp 


Ser 


Leu 


Val 


His 


He 


Asn 


He 


Phe 


Leu 


Val 


Asp 


Lys 


T ,\rc 
uy o 


UlD XT X \J 




690 










695 










700 








Glu 


Phe 


Ser 


Ser 


Ser 


Tvr 


Asn 


Arcr 


Val 


Asn 




Ser 


He 


Asp 


Val A«;r> 
v ax nop 


705 










710 










715 








720 


Phe 


Asn 


Cvs 


Leu 


Asp 


Val 


Leu 


He 


Thr 


Leu 


Gin 


Thr Trp 


V dl 


Val T1p* 










725 










730 












Leu 


Asp 


Phe 


Phe 


Glv 


He 


Glv 


Ser 


Thr 


Ala 


Asp 


Asn 


His 


Ala 


Met Arg 








740 










745 










750 


Leu 


Pro 


Pro 


Glu 


Glv 


He 


Leu 


His 


Asn 


Val 


Lys 


Leu 


Glu 


Pro 


Hi <5 Al a 






755 










760 










765 






Ser 


Met 


Glu 


Ser 


Glv 


Leu 


Gin 


Asp 


Pro 


Val 


Asn 


Thr 


Lys 


Leu 


XJC. Li 




770 










775 










780 








Lys 


Val 


His 


Ser 


Leu 


Ser 


Leu 


Val 


Leu 


Asn 


Lys 


Thr 


Thr 


Ser 




785 










790 










795 








800 


Ala 


Lys 


Ala 


Asn 


Val 


Ser 


Lys 


Leu 


Val 


Ala 


His 


Leu 


Glu 


Met 


Tie Rln 










805 










810 










815 


Gly 


Asp 


Leu 


Ala 


Leu 


Gin. Gly 


Ser 


He Gly 


Ser 


Leu 


Ser 


Leu 


Ser Asp 








820 










825 










830 




Leu 


Thr 


Cys 


His 


Gly 


Glu 


Phe 


Tvr 


Arg Glu 


Arcr 


Phe 


Thr 


Thr 


Ser Gly 






835 










840 










845 






Glu 


Glu 


Ala 


Leu 


He 


Phe 


Gin 


Thr 


Phe Lys 


Tvr 

2 1 - 


Gly Arg 


Pro 






850 










855 










860 








Leu 


Leu 


Arg 


Arg 


Glu 


His Asp 


He 


Arg Val 


Ser 


Leu Arg 


Met 


Ala Ser 


865 










870 










875 








880 


Val 


Gin 


Tyr 


Val 


His 


Thr 


Gin 


Arcr 


Phe 


Gin 


Ala 


Glu Val 


Val 


Ala Phe 










885 










890 










895 


lie 


Gin 


His 


Phe 


Thr 


Gin 


Leu 


Gin 


Asp Val 


Leu 


Gly Arg 


Gin 


Arg Ala 








900 










905 










910 




Ala 


He 


Glu 


Gly 


Gin 


Thr 


Val 


Arg 


Asp 


Gin 


Ala 


Gin Arg 


Cvs 

2 " 


Ser Arg 






915 










920 










925 






Val 


Leu 


Leu 


Asp 


lie 


Glu Ala 


Gly 


Ala 


Pro 


Val 


Leu 


Leu 


He 


Pro Glu 




930 










935 










940 








Ser 


Ser 


Arg 


Ser 


Asn 


Asn 


Leu 


He 


Val 


Ala 


Asn 


Leu Gly 


Lvs 


Leu Lys 


945 










950 










955 








960 


Val 


Lys 


Asn 


Lys 


Phe 


Leu 


Phe 


Ala 


Gly Phe 


Pro 


Gly Thr 


Phe 


Ser Leu 










965 










970 










975 


Gin 


Asp 


Lys 


Glu 


Ser 


Val 


Pro 


Ser 


Ala 


Ser 


Pro 


Thr Gly 


He 


Pro Lys 








980 










985 










990 


His 


Ser 


Leu 


Arg 


Lys 


Thr 


Thr 


Ser 


Thr 


Glu 


Glu 


Pro Arg 


Gly 


Thr His 






995 








1000 








1005 






Ser 


Gin 


Gly 


Gin 


Phe 


Thr 


Met 


Pro 


Leu Ala 


Gly 


Met 


Ser 


Leu 


Gly Ser 
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1010 1015 1020 

Leu Lys Ser Glu Phe Val Pro Ser Thr Ser Thr Lys Gin Gin Gly Pro 
1025 1030 1035 1040 

Gin Pro Thr Leu Ser Val Gly Gin Glu Ser Ser Ser Pro Glu Asp His 
1045 1050 1055 



val 


Cys 


Leu 


Leu 


Asp Cys Val Val Val 


Asp Leu Gin Asp His Gly Pro 






1060 




1065 




1070 


Ser 


Leu Leu Arg 


Arg Asp He Arg Arg 


Asp Tyr Ser Lys Gly Thr Arg 




1075 






1080 


1085 


Gly 


















<210> 


92 












<211> 


182 












<212> : 


PRT 












<213> Homo 


sapiens 








<400> 


92 










Met 


Ser 


Asn 


Glu 


Thr 


Val Leu Leu Ala 


Lys His Asn 


He Phe Thr Leu 


1 












10 


15 


Ala 


Leu 


Met 


He 


VaX 


ei.0XX JjcU -fllti iipll 


Met Phe He 


Thr Tyr Gly Asp 








20 




M 3 




30 


Thr 


Phe 


Leu 


Pro 


Thr 


Pro Ser Ser Tyr 


Asp Glu Leu 


Tyr Tyr Glu He 






35 






40 




45 


He 


Arg 


Met 


His 


Gin 


Ser Phe Asp Asn 


Leu Tyr Ser 


Met Gly Leu Arg 




50 








55 


60 




Leu 


Ser 


Thr 


Asn 


Ala 


Gly Gin Trp Lys 


Glu Ala Ala 


Ser Lys Val Thr 


65 










70 


75 


80 


His 


Ala 


Leu 


Val 


Asn 


He Arg Ala He 


He Asn His 


Phe Asn Pro Lys 










85 




90 


95 


He 


Glu 


Ser 


Tyr 


Ala 


Ala Val Asn His 


He Ser Gin 


Leu Ser Glu Glu 








100 




105 




110 


Gin 


Val 


Leu 


Glu 


Gly 


Val Arg Ala Asn 


Tyr Asp Thr 


Leu Thr Leu Lys 






115 






120 




125 


Leu 


Gin 


Asp 


Gly 


Leu 


Asp Gin Tyr Glu 


Arg Tyr Ser 


Glu Gin His Lys 




130 








135 


140 




Glu 


Ala 


Ala 


Phe 


Phe 


Lys Glu Leu Val 


Arg Ser He 


Ser Thr Asn Val 


145 










150 


155 


160 


Arg 


Arg 


Asn 


Leu 


Ala 


Phe His Thr Leu 


Ser Gin Glu 


Val Leu Leu Lys 










165 




170 


175 


Glu 


Phe 


Ser 


Thr 


He 


Ser 












180 












c210> 93 












<211> 255 












<212> PRT 












<213> Homo 


sapiens 








<400> 93 










Met 


Ser Arg Asn 


Ser Arg Lys Thr He 


Gly Lys Gly 


Arg Val Asp Tyr 


1 








5 




10 


15 


He 


He 


He 


Lys 


Pro 


Leu Pro Gly Tyr 


Ser Cys Asp 


Met Lys Ser Ser 








20 




25 




30 


Phe 


Ser 


Lys 


Tyr 


Trp Lys Pro Arg He 


Pro Leu Asp 


Val Gly His Arg 






35 






40 




45 


Gly 


Ala Gly Asn 


Ser 


Thr Thr Thr Ala 


Gin Leu Ala 


Lys Val Gin Glu 




50 








55 


60 




Asn 


Thr 


He 


Ala 


Ser Leu Arg Asn Ala 


Ala Ser His 


Gly Ala Ala Phe 



49 



WO 01/74836 



PCT/US01/10472 



65 70 75 80 

Val Glu Phe Asp Val His Leu Ser Lys Asp Phe Val Pro Val Val Tyr 

85 90 95 

His Asp Leu Thr Cys Cys Leu Thr Met Lys Lys Lys Phe Asp Ala Asp 

100 105 110 

Pro Val Glu Leu Phe Glu lie Pro Val Lys Glu Leu Thr Phe Asp Gin 

115 120 125 

Leu Gin Leu Leu Lys Leu Thr His Val Thr Ala Leu Lys Ser Lys Asp 

130 135 140 

Arg Lys Glu Ser Val Val Gin Glu Glu Asn Ser Phe Ser Glu Asn Gin 
145 150 155 160 

Pro Phe Pro Ser Leu Lys Met Val Leu Glu Ser Leu Pro Glu Asp Val 

165 170 175 

Gly Phe Asn He Glu He Lys Trp He Cys Gin Gin Arg Asp Gly Met 

180 185 190 

Trp Asp Gly Asn Leu Ser Thr Tyr Phe Asp Met Asn Leu Phe Leu Asp 

195 200 205 

He He Leu Lys Thr Val Leu Glu Asn Ser Gly Lys Arg Arg He Val 

210 215 220 

Phe Ser Ser Phe Asp Ala Asp He Cys Thr Met Val Arg Gin Lys Gin 
225 230 235 * 240 

Asn Lys Tyr Pro He Leu Phe Leu Thr Gin Gly Lys Cys Val Asp 
245 250 255 



<210> 94 
<211> 49 
<212> PRT 
<213> Homo 



sapiens 



<400> 94 
Met Arg Gly Asn Tyr Glu Thr Leu 

1 5 
Lys Pro Glu Val Leu Ser Gin He 
20 

Trp Arg Arg Pro Gly Pro Lys He 
35 40 

Pro 



Val Ser Leu Asp Tyr Ala He Ser 

10 15 
Glu Gin Gly Lys Glu Pro Cys Asn 

25 30 
Pro Asp Val Pro Val Asp Pro Ser 
45 



<210> 95 

<211> 308 

<212> PRT - 

<213> Homo sapiens 





<400> : 


95 














Met 


Gly 


Ser 


Ser 


Gly 


Leu 


Gly Lys Ala 


Ala 


Thr Leu Asp Glu 


Leu Leu 


1 








5 






10 




15 


Cys 


Thr 


Cys 


He 


Glu 


Met 


Phe Asp Asp 


Asn 


Gly Glu Leu Asp 


Asn Ser 








20 






25 




30 




Tyr 


Leu 


Pro 


Arg 


He 


Val 


Leu Leu Met 


His 


Arg Trp Tyr Leu 


Ser Ser 






35 








40 




45 




Thr 


Glu 


Leu 


Ala 


Glu 


Lys 


Leu Leu Cys 


Met 


Tyr Arg Asn Ala 


Thr Gly 




50 










55 




60 




Glu 


Ser 


Cys 


Asn 


Glu 


Phe 


Arg Leu Lys 


lie 


Cys Tyr Phe Met 


Arg Tyr 


65 










70 






75 


80 


Trp 


He 


Leu 


Lys 


Phe 


Pro 


Ala Glu Phe 


Asn 


Leu Asp Leu Gly 


Leu He 










85 






90 




95 


Arg 


Met 


Thr 


Glu 


Glu 


Phe 


Arg Glu Val 


Ala 


Ser Gin Leu Gly 


Tyr Glu 
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100 




105 


Lys 


His 


Val 


Ser 


Leu 


He Asp lie Ser 






115 






120 


Met 


Arg 


Arcr 


Val 


Thr 


Gin Arg Lys Lys 




130 








135 


Cys 


Leu 


Leu 


Phe 


Asp 


His Leu Glu Pro 


145 










150 


Thr 


Phe 


Leu 


Glu 


His 


Lys Ser Phe Arg 










165 




Gin 


Ser 


Tyr 


Val 


He 


His Gly Cys Leu 








180 




185 


Arg 


Ser 


He 


Ala 


Leu 


Phe Asn Gly He 






195 






200 


Val 


Leu 


Ser 


Lys 


Pro 


Thr Pro Gin Gin 




210 








215 


Phe 


He 


Asn 


Val 


Ala 


Lys Lys Leu Leu 


225 










230 


Leu 


He 


Ala 


He 


Ala 


Glv Spr Pro Gin 










245 




Ser 


Lys 


Gly 


Pro 


He 


Pro His Leu Ser 








260 




265 


Glu 


Cys 


Glu 


Met 


Thr 


Glu Val Gly Leu 






275 






280 


lie 


Thr 


Ala 


Ser 


Pro 


Leu Pro Thr Ala 




290 








295 


Leu 


Glu 


Tyr 


Thr 







305 



<210> 96 
<211> 217 
<212> PRT 

<213> Homo sapiens 





<400> ! 


96 






Met 


Gly 


Val 


Pro 


Val 


Thr Ser Gin Glu 


1 








5 




Pro 


Leu 


Cys 


Leu 


Ala 


Pro Tyr Phe Arg 








20 




25 


Arg 


Gin 


His 


Asn 


Gly 


Asn Leu Glu Glu 






35 






40 


Pro 


He 


Phe 


Leu 


Thr 


Asp Leu Glu Pro 




50 








55 


Ser 


Ala 


Lys 


Glu 


Arg 


Ser Phe Met Cys 


65 










70 


Asn 


Trp 


Phe 


Arg 


Glu 
85 


He Val Asn Ala 


Glu 


Met 


Lys 


Gly 


Lys 


Val Leu Thr Arg 








100 




105 


Gin 


He 


He 


Leu 


Glu 


Lys Tyr Leu Ala 






115 






120 


Pro 


Leu 


Gly 


Asn 


Phe 


Asp Val Glu Thr 




130 








135 


Val 


Thr 


Ala 


He 


Ser 


Ala Lys He Arg 


145 










150 


Thr 


Leu 


Lys 


Glu 


Arg 


Cys Leu Gin Val 










165 




Glu 


Asn 


Tyr 


Arg 


Arg 


Leu Asp He val 








180 




185 


Glu 


Asp 


His 


Pro 


Asn 


Val Gin Lys Asp 






195 






200 


Arg 


He 


Ala 


His 


Gin 


Arg Met Gly Asp 



PCT/US01/10472 

110 



Ser He 


Pro Ser Tyr Asp 


Trp 




125 




Val Ser 


Lys Lys Gly Lys 


Ala 




140 




He Glu 


Leu Ala Glu His 


Leu 


155 




160 


Arg He 


Ser Phe Thr Asp 


Tyr 


170 


175 




Glu Asn 


Asn Pro Thr Leu 


Glu 




190 




Ser Lys 


Trp Val Gin Leu 


Met 




205 




Arg Ala 


Glu Val He Thr 


Lvs 




220 




Gin Leu 


Lys Asn Phe Asn 


Asn 


235 




240 


Val He 


Gly Pro Phe Gin 


Ala 


250 


255 




Ser Glu 


Val Tyr Lys Glu 


Leu 




270 




Leu Gin 


Arg Ala He Thr 


Ala 




285 




Met Ala 


Ser Lys Ser Pro 


Ser 




300 




Ser Gly 


Gin Lvs Leu Val 


Ser 


10 


15 




Leu Leu 


Lys Leu Cys Val 


Glu 




30 




He Asp 


Gly Leu Leu Asn 


Cvs 




45 




Gly Glu 


Lys Leu Glu Ser 


Met 




60 




Ser Leu 


He Phe Leu Thr 


Leu 


75 




80 


Phe Cys 


Gin Glu Thr Ser 


Pro 


90 


95 




Leu Lys 


His He Val Glu 


Leu 




110 




Val Thr 


Pro Asp Tyr Val 


Pro 




125 




Leu Asp 


He Thr Pro His 


Thr 




140 




Lys Lys 


Gly Lys He Val 


Tyr 


155 




160 


Val Arg 


Ser Leu Val Lys 


Pro 


170 


175 




Arg Ser 


Leu Tyr Glu Asp 


Leu 




190 




Leu Glu 


Arg Leu Thr Gin 


Glu 



205 
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210 215 



<210> 97 

<211> 265 

<212> PRT 

<213> Homo sapiens 





<400> 97 






Met 


Pro 


Ala 


Pro Arg Ala 


Arcr Glu Gin Pro Arcr Val Pro 


Glv Glu Arcr 

\JlJr VJ J.U AJ.y 


l 






5 


10 


15 


Gin 


Pro 


Leu 


Leu Pro Arg 


(11 v Qla Arcr Gl \r Pr<*> Arer Arcr 

vjijr £-a_ia y \7ijr xriu .rny ^u.y 


Ai F Aiy Aiy 








20 


25 


30 


Ala 


Ala 


Gly 


Ala Ala Val 


T^n T.^u V^l Rln Mf=>1- Ti(=»n ftJl ii 


ni.^ Alcl Alct 






35 




40 45 




Phe 


Phe 


Glv 


Val Thr Ala 


Aoli lieu, val Dt=Ll lyi JLicLl Aoll 


OCl 1U1 A tail 




50 






SO D L> 






Asn 


Trn 


Thr Gly Glu 


ijjlii Aia inr Arg Aia ax a Lieu 


vai jfne Jueu 








70 




o u 




Ala 


Ser 


Vyic Leu Leu 


Ala FiO V ell Vjiy V^iy Jlip IicU 


Ala. ASp Val 








85 






Tyr 


Leu 


Glv 


Arcr Tvr Arcr 


Ala Val Ala JjcU bSl JLieil .ueu 


lieu xyr .ueu 








100 


J. U D 


11U 


AT P5 

i-i-Lct 


Ala 


Ser 


Gly Leu Leu 


riO Ala I ill Ala. irllci rlU AoJU 


uly Aiy oci 






115 




120 125 




Ser 


Phe 


Cys 


Gly Glu Met 


Pro Ala Ser Pro Leu Gly Pro 


Ala Cys Pro 




130 






135 140 




Ser 


Ala 


Gly 


Cys Pro Arg 


Ser Ser Pro Ser Pro Tyr Cys 


Ala Pro Val 


145 






150 


155 


160 


Leu 


Tyr 


Ala 


Gly Leu Leu 


Leu Leu Gly Leu Ala Ala Ser 


Ser Val Arg 








165 


170 


175 


Ser 


Asn 


Leu 


Thr Ser Phe 


Gly Ala Asp Ala Gly Asp Gly 


Ser Arg Pro 








180 


185 


190 


Arg 


Arg 


His 


Pro Pro Leu 


Leu Gin Leu Val Leu Leu Glu 


His Gin Pro 






195 




200 205 




Gly 


Cys 


Cys 


Ala Val Ala 


Ala Gly Gly Gly Val Tyr Ser 


Ala Glu His 




210 






215 220 




Gin 


Leu 


Pro 


Ala Gly Leu 


Gin His Pro Cys Gly Leu Cys 


Gly Pro Gly 


225 






230 


235 


240 


lie 


Phe 


His 


Leu Pro Leu 


Cys His Pro Arg Leu His His 


Gin Ala Pro 








245 


250 


255 


Asp 


Gly 


Gin 


Pro Ser Val 


Leu Tyr Ala 










260 


265 





<210> 98 

<211> 111 

<212> PRT 

<213> Homo sapiens 



<400> 98 



Met 


He 


Lys 


Tyr Asn Glu 


Gly Arg Ala Glu Gly Leu Ser 


Leu 


Ala 


Val 


1 






5 


10 




15 




Pro 


Gly Cys 


Cys Gly Gly 


Gly Arg Leu Ala Thr Gin Thr 


Val 


Ala 


Ala 








20 


25 


30 






Ala 


Ala 


Ala 


Ser Trp Trp 


Pro Pro Cys Trp Arg Gly Ser 


Leu Ala 


Phe 






35 




40 45 








Ser 


Val 


Ala 


Ala Ser Ala 


Val Ser Phe Ala Phe Ser Leu 


Ala 


Ser 


Leu 




50 






55 60 








Ala 


Val 


Ser Thr Gly Gly 


Leu Glu Gly Ala Ser Pro Cys 


Ser 


Leu Ala 



52 



WO 01/74836 



PCT/US01/10472 



65 70 75 80 



Lys 


He 


Tyr 


Ser 


Lys 


Pro 


Phe 


He 


Glv 


Leu 


Glv 


His 


Val 


Ala 


Phe 


Glu 










85 










90 










95 




Pro 


Gly Lys 


Thr 


Lys Phe 


Leu 


Gly 


Gin 


Leu 


Leu 


Glu 


Thr 


Pro 


Asn 










100 










105 










110 








<210> 99 




























<211> 421 




























<212> PRT 




























<213> Homo 


sapiens 
























<400> 99 


























Met 


Ala 


Met 


Ala 


Thr 




Gly 


Gly 


Lys 


Gly 


Lys 


Gin 


Ala 


Ala 


Pro 


Lvs 


1 








c 










10 










15 




Gly 


Arci 


Glu 


Ala 




Arg 


Ser 


Gin 


Arg 


Arg 


Glu 


Ser 


Glu 


Glv 


Ser 


Val 


















25 










30 






Asp 


Cys 


Pro 


Thr 


eu 


pi ,, 


Phe 


Glu 


Tyr 


Gly 


Asp 


Ala 


Asp 


Gly 


His 


Ala 
















40 










45 








Ala 


Glu 


Leu 


Ser 


L3.L U 


Leu 


Tyr 


Ser 


Tvr 


Thr 


Glu 


Asn 


Leu 


Glu 


Phe 


Thr" 




50 










55 










60 










Asn 


Asn 






Cys 


fne 


Glu 


Glu 


Asp 


Phe 


LVS 


Thr 


Gin 


Val 


Gin 


Glv 


65 










1 u 










75 










80 


Lys 


Glu 


Tv-n 


Leu 




Leu 


Glu 


Glu 


Asp 


Ala 


Gin 


Lys 


Ala 


Tvr 


He 


Met 










Q R 










90 










95 




Gly 


Leu 


Leu 


Asp 


Arg 


Leu 


Glu 


Val 


Val 


Ser 


Arcx 


Glu 


Arcr 


Arcr 


Leu 


Lys 








100 










105 










110 






Ala 


Ala 


Arcr 


Ala 


Val 


Jj6U 


Tvr 


Leu 


Ala 


Gin 


Glv 


Thr 


Phe 


Glv 


Glu 


Cys 






115 










120 










125 








Asp 


Ser 


Glu 


Val 


Asp 


Val 


Leu 


His 


Trp 


Ser 


Ara 


Tvr 


Asn 


Cvs 


Phe 


Leu 




130 










135 










140 










Leu 


Tyr 


Gin 


Met 


Glv 


Thr 


Phe 


Ser 


Thr 


Phe 


Leu 


Glu 


Leu 


Leu 


His 


Met 


145 










150 










155 










160 


Glu 


He 


Asp 


Asn 


Ser 


Gin 


Ala 


Cvs 


Ser 


Ser 


Ala 


Leu 


Arcr 


Lys 


Pro 


Ala 










165 










170 










175 




Val 


Ser 


He 


Ala 


Asp 


Ser 


Thr 


Glu 


Leu 


Arcr 


Val 


Leu 


Leu 


Ser 


Val 


Met 








180 










185 










190 






Tyr 


Leu 


Met 


Val 


Glu 


Asn 


He 


Arg 


Leu 


Glu 


Arcr 


Glu 


Thr 


Asp 


Pro 


Cys 






195 










200 










205 








Gly 


Trp 


Arg 


Thr 


Ala 


Ara 


Glu 


Thr 


Phe 


Arcr 


Thr 


Glu 


Leu 


Ser 


Phe 


Ser 




210 










215 










220 










Met 


His 


Asn 


Glu 


Glu 


Pro 


Phe 


Ala 


Leu 


Leu 


Leu 


Phe 


Ser 


Met 


Val 


Thr 


225 










230 










235 










240 


Lys 


Phe 


Cys 


Ser 


Gly 


Leu 


Ala 


Pro 


His 


Phe 


Pro 


He 


Lys 

2 


Lys 

2 


Val 


Leu 










245 










250 










255 




Leu 


Leu 


Leu 


Trp 


Lys 


Val 


Val 


Met 


Phe 


Thr 


Leu 


Gly 


Gly 


Phe 


Glu 


His 








260 










265 










270 






Leu 


Gin 


Thr 


Leu 


Lys 


Val 


Gin 


Lys 


Arg 


Ala 


Glu 


Leu 


Gly 


Leu 


Pro 


Pro 






275 










280 










285 








Leu 


Ala 


Glu 


Asp 


Ser 


lie 


Gin 


Val 


Val 


Lys 


Ser 


Met 


Arg 


Ala 


Ala 


Ser 




290 










295 










300 










Pro 


Pro 


Ser 


Tyr 


Thr 


Leu 


Asp 


Leu 


Gly 


Glu 


Ser 


Gin 


Leu 


Ala 


Pro 


Pro 


305 










310 










315 










320 


Pro 


Ser 


Lys 


Leu 


Arg 


Gly 


Arg 


Arg 


Gly 


Ser 


Arg 


Arg 


Gin 


Leu 


Leu 


Thr 










325 










330 










335 




Lys 


Gin 


Asp 


Ser 


Leu 


Asp 


He 


Tyr 


Asn 


Glu 


Arg 


Asp 


Leu 


Phe 


Lys 


Thr 








340 










345 










350 






Glu 


Glu 


Pro 


Ala 


Thr 


Glu 


Glu 


Glu 


Glu 


Glu 


Ser 


Ala 


Gly 


Asp 


Gly 


Glu 






355 










360 










365 








Arg 


Thr 


Leu 


Asp 


Gly 


Glu 


Leu 


Asp 


Leu 


Leu 


Glu 


Gin 


Asp 


Pro 


Leu 


Val 




370 










375 










380 










Pro 


Pro 


Pro 


Pro 


Ser 


Gin 


Ala 


Pro 


Leu 


Ser 


Ala 


Glu 


Arg 


Val 


Ala 


Phe 
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385 390 395 400 

Pro Lys Gly Leu Pro Trp Ala Pro Lys Val Arg Gin Lys Gly Thr Leu 

405 410 415 

Ser Thr Ser Trp Arg 
420 



<210> 100 

<211> 290 

<212> PRT 

<213> Homo sapiens 



<400> 100 



Met 


Pro 


Phe 


Asp Phe Arg 


Arg Phe Asp He 


Tvtt Arg TiV^? 


Val Prn T^vc* 

V CLX f XW XJyO 


1 






5 


10 




13 


Asp 


Leu 


Thr 


Gin Pro Thr 


Tvr Thr* Glv Ala 


He Tl r <;pr 

XX w XXC JCL 


lie v#ya Lyb 








2 0 








Cvs 


Leu 


Phe 


He Leu Phe 


Leu Phe Leu Ser 


VJXU JJCU JLl.Lt. 


Hi \r p>ip Tip 
oxy ir lit: lie 






35 




40 




Thr 


Thr 


Glu 


Val Val Asn 


Glu TiPU T\rr Val 


M Hop XT X (J 


Asp x*ys Asp 




50 






55 


D U 




Ser 


Glv 


Glv 


Lvs He Asd 


Val Spit Leu Aqtj 

VO.X iJCX J-ICU flOil 


11" OCX UcU 


rlU Aoll iicli 


65 






70 




/ D 


BO 


His 


Cvs 


Glu 


Leu Val Gly 


Leu Asd Hp C?l n 


V31 u IMC: L. 


vjiy fity mo 














3D 


Glu 


Val 


Glv 


His He Asp 


Asn. Ser Met Lvs 


Tip "P>~o T.pii 


Aesrt fieri f!l ^/ 

rtoii noil vjxy 








100 


1U D 




Tin 


Ala 


Gly 




i-ixy riisz uiu 


Laxy ox I J. file oej_ 


xxe /isn J-»ys 


Val Pro Gly 
















Asn 


Phe 


His 


Val Ser Thr 


His Ser Ala Thr 


Ala Gin Pro 


Gin Asn Pro 




130 






135 


140 




Asp 


Met 


Thr 


His Val He 


His Lys Leu Ser 


Phe Gly Asp 


Thr Leu Gin 


145 






150 




155 


160 


Val 


Gin 


Asn 


He His Gly 


Ala Phe Asn Ala 


Leu Gly Gly 


Ala Asp Arg 








165 


170 




175 


Leu 


Thr 


Ser 


Asn Pro Leu 


Ala Ser His Asp 


Tyr He Leu 


Lys He Val 








180 


185 




190 


Pro 


Thr 


Val 


Tyr Glu Asp 


Lys Ser Gly Lys 


Gin Arg Tyr 


Ser Tyr Gin 






195 




200 


205 




Tyr 


Thr 


Val 


Ala Asn Lys 


Glu Tyr Val Ala 


Tyr Ser His 


Thr Gly Arg 




210 






215 


220 




He 


He 


Pro 


Ala He Trp 


Phe Arg Tyr Asp 


Leu Ser Pro 


He Thr Val 


225 






230 




235 


240 


Lys 


Tyr 


Thr 


Glu Arg Arg 


Gin Pro Leu Tyr 


Arg Phe He 


Thr Thr He 








245 


250 




255 


Cys 


Ala 


He 


He Gly Gly 


Thr Phe Thr Val 


Ala Gly He 


Leu Asp Ser 








260 


265 




270 


Cys 


He 


Phe 


Thr Ala Ser 


Glu Ala Trp Lys 


Lys He Gin 


Leu Gly Lys 






275 




280 


285 




Met 


His 














290 













<210> 101 
<211> 133 
<212> PRT 
<c2!3> Homo sapiens 



<400> 101 

Met Pro Pro Arg Asn Leu Pro Gly His Cys Thr Ser Pro Pro Ala He 
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15 10 15 

Arg Val Pro Pro Val lie Pro Leu Gly Ser Arg Glu Leu Phe Cys Ser 

20 25 30 

Lys Leu Arg Arg Ala Ala Val Phe Pro Pro Ala His Gin Gin Arg Thr 

35 40 45 

Leu Arg Pro Cys His Asn Tyr Cys Lys Gly Arg Arg Cys Leu Leu Thr 

50 55 60 

His Cys Gly Cys Asp Ala Pro His Leu Pro Arg Ala Pro Pro Pro Leu 
65 70 75 80 

Arg Thr Asp Val Gly Asp Ser Thr Pro Phe Arg Glu Lys Pro Val Ser 

85 90 95 

Ala Ala Glu Asp Ala Asn Ala Ser Glu Ser Pro Leu Thr Leu Asn His 

100 105 110 

Ser Leu Glu Met Gly Val Val Tyr Leu Thr Thr Gly Ala Val Ser Gly 

115 120 125 

Asn Ala Asn Tyr His 
130 



<210> 102 
<211> 38 
<212> PRT 

<213> Homo sapiens 



<400> 102 
Met His Arg lie Trp lie Glu Asp 

1 5 
Thr lie lie Ser Gly Leu Tyr Gly 
20 

Arg Tyr Lys Tyr Phe Pro 
35 



Thr Thr Trp Lys Phe Ser Asn Phe 

10 15 
Leu Thr Thr Phe Asn Phe Leu Leu 
25 30 



<210> 103 
<211> 1130 
<212> PRT 

<213> Homo sapiens 





<400> 103 














Met 


Ser 


Ala 


Gly 


lie 


Gly Pro Met Ala Trp Trp Pro Ser Thr 


Thr 


Gly 


1 








5 




10 




15 




Pro 


Cys 


Met 


Met 


Ser 


Thr Val Ser Thr 


Met 


Ala Lys Pro His 


Arg Glu 








20 




25 




30 






Cys 


Pro 


Gly 


Cys 


Phe 


Val Pro Phe Ala 


Val 


Cys Val Val Ser 


Arg Phe 






35 






40 




45 






Pro 


Tyr 


Tyr 


Asn 


Ser 


Leu Lys Asp Cys 


Leu 


Ser Cys Leu Leu 


Ala 


Leu 




50 








55 




60 






Leu 


Lys 


Pro 


Cys 


Lys 


Asp Phe Glu Val 


Asp 


Asn His lie Lys 


Asp 


Phe 


65 










70 




75 




80 


Ala 


Ala 


Lys 


Leu 


Ser 


Leu lie Pro Ser 


Pro 


Pro Pro Gly Pro 


Leu 


His 










85 




90 




95 




Leu 


Val 


Phe 


Asn 


Met 


Lys Ser Leu Gin 


lie 


Val Leu Pro Ala 


Arg Ala 








100 




105 




110 






Asp 


Pro 


Glu 


Ser 


Pro 


lie Leu Asp Leu Asp Leu His Leu Pro 


Leu 


Leu 






115 






120 




125 






Cys 


Phe 


Arg 


Pro 


Glu 


Lys Val Leu Gin 


lie 


Leu Thr Cys lie 


Leu 


Thr 




13 0 








135 




140 






Glu 


Gin 


Arg 


lie 


Val 


Phe Phe Ser Ser Asp Trp Ala Leu Leu 


Thr 


Leu 


145 










150 




155 




160 


Val 


Thr 


Glu 


Cys 


Phe 


Met Ala Tyr Leu Tyr 


Pro Leu Gin Trp 


Gin 


His 
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165 



Pro 


Phe 


Val 


Pro 


lie 


Leu 


Ser Asp 








180 








Pro 


Thr 


Ser 


Phe 


Leu 


Met 


Gly Cys 






195 








200 


Ser 


Lys 


Glu 


Ala 


Asp 


Gly 


Leu Val 




210 










215 


lie 


Thr 


Tyr 


Ser 


Lys 


Ser 


Thr Asp 


225 










230 




Pro 


Leu 


Leu 


Ala 


Ala 


Gin 


Thr Phe 










245 






Leu 


His 


His 


Glu 


Leu 


His 


Ala Ala 








260 








Lys 


Glu 


Gly 


Arg 


Ala 


His 


Arg Arg 






275 








280 


Gin 


He 


Gin 


Gin 


Thr 


Thr 


Leu Gin 




290 










295 


Val 


Lys 


Asn 


His 


Leu 


Asn 


Tyr Glu 


305 










310 




Phe 


Leu 


Lys 


Thr 


Arg 


Ala 


Pro Gly 










325 






Leu 


Asp 


Thr 


Tyr 


Met 


Phe 


His Ser 








340 








Arg 


Met 


Asp 


Ala 


Phe 


Ala 


Gin Met 






355 








360 


Asp 


Arg 


He 


Asn 


Gly 


Met 


Leu Leu 




370 










375 


Lys 


Arq 


Ala 


Ser 


Arq 


Lvs 


Ser Ser 


385 










390 




Met 


Val 


Val 


Ser 


Met 


Pro 


Asn Leu 










405 






Ala 


Pro 


Arg 


Asn 


Ser 


Ser 


Leu Arg 








420 








Gly 


Ser 


Ser 


Ala 


Val 


Leu 


Asn Val 






435 








440 


Lys 


lie 


Pro 


Glu 


He 


His 


Phe Pro 




450 










455 


Tyr 


His 


Ala 


His 


Phe 


Val 


Ser Met 


465 










470 




Ala 


Pro 


Asp 


Asn 


Ser 


Leu 


Leu Leu 










485 






Leu 


Val 


Tyr 


Leu 


Met 


Gin 


Gly Gin 








500 








Gin 


Asn 


Leu 


Tyr 


Lys 


Thr 


Asp He 






515 








520 


Lys 


Arg 


Thr 


Val 


Glu 


Ser 


Met Ser 




530 










535 


Gin 


Ala 


Pro 


Glu 


Leu 


Met 


Arg Leu 


545 










550 




His 


Glu 


Ala 


Ser 


Lys 


Leu 


Asp Asp 










565 






Lys 


Lys 


His 


Met 


Gin 


Leu 


Gly Asp 








580 








Gly 


He 


Val 


Lys 


Asp 


Ala 


Ser He 






595 








600 


Thr 


Val 


Gly 


Gin 


Glu 


Lys 


Gin He 




610 










615 


Tyr 


Asn 


Cys 


Trp 


Lys 


Glu 


Thr Glu 


625 










630 




Pro 


Trp 


Leu 


Val 


Met 


Glu 


His Leu 










645 






Leu 


Ser 


Ser 


Ser 


Val 


Lys 


Thr Asn 








660 








Thr 


Gin 


Lys 


Arg 


Leu 


Phe 


Leu Leu 





170 










175 




Gin 


Met 


Leu 


Asp 


Phe 


Val 


Met 


Glv 

j 


185 










190 






His 


Leu 


Asp 


His 


Phe 


Glu 


Glu 


Val 










205 








Leu 


He 


Asn 


He 


Asp 


His 


Glv 


Ser 








220 










Asp 


Asn 


Val 


Asp 


lie 


Pro 


Asp 


Val 






235 










240 


He 


Gin 


Arq 


Val 


Gin 


Ser 


Leu 


Gin 




250 










255 




His 


Leu 


Leu 


Ser 


Ser 


Thr 


Asp 


Leu 


265 










270 






Ser 


Trp 


Gin 


Gin 


Lvs 

■Jr 


Leu 


Asn 


Cys 










285 








Leu 


Leu 


Val 


Ser 


He 


Phe 


Arg 


Asp 








300 










His 


Ara 


Val 


Phe 


Asn 


Ser 


Glu 


Glu 






315 










320 


Asp 


His 


Gin 


Phe 




j_»ys 




vax 




330 










335 




Phe 


Leu 


Lys 


Ala 




Leu 


Asn 


At*ct 

-Hxy 


345 










350 






Asp 


Leu 


Asp 


Thr 


Gin 


Ser 




Glu 










365 








Ser 


Pro 








Thr 
X UI 


V d_L 


ui, LL 








380 










His 


Leu 


His 


Val 


Thr 


His 


A yrr 


A ircx 






395 










400 


Gin 


Asp 


He 


Ala 


Met 


XT J. U 


ai ii 






410 










415 




Leu 


Thr 


Asp 


Thr 


Ala 


Gly 






425 










430 






Thr 


Pro 


Lys 


Ser 


Pro 


iyr 


TVir- 

JL IxJL 












445 








Leu 


Glu 


Ser 


Lys 


Cys 


Val 


Gin 


Ala 








460 










Leu 


Ser 


Glu 


Ala 


Met 


Cys 


Phe 








475 










480 


Ala 






Leu 




Leu 




uxy 




490 










495 




Leu 


Leu 


Asn 


Ala 


Leu 


Leu 


Asp 


Phe 


505 










510 






Arq 


He 


Phe 


Pro 


Thr 


Asp 


Leu 


Val 










525 








Ala 


Pro 


Glu 


T^-p 


Glu 


Glv 


Ala 


Glu 








540 










He 


Ser 


Glu 


He 


Leu 


Asp 


Lys 


Pro 






555 










560 


His 


Val 


Lvs 


Lvs 


Phe 


Lys 


Leu 


Pro 




570 










575 




Phe 


Met 


Lys 


Arg 


Val 


Gin 


Glu 


Ser 


585 










590 






He 


His 


Arq 


Leu 


Phe 


Glu 


Ala 


Leu 










605 








ASP 


Pro 


Glu 


Thr 


Phe 


Lys 


Asp 


Phe 








620 










Ala 


Glu 


Ala 


Gin 


Glu 


Val 


Ser 


Leu 






635 










640 


Asp 


Lys 


Asn 


Glu 


Cys 


Val 


Cys 


Lys 




650 










655 




Leu 


Gly 


val 


Gly 


Lys 


He 


Ala 


Met 


665 










670 






Thr 


Glu 


Gly 


Arg 


Pro 


Gly 


Tyr 


Leu 
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675 










680 










685 








Glu 


He 


Ser 


Thr 


Phe 


Arg 


Asn 


He 


Glu 


Glu 


Val 


Ara 


Arcr 


Thr 


Thr 


Thr 




690 










695 










700 










Thr 


Phe 


Leu 


Leu 


Ara 


Arg 


He 


Pro 


Thr 


Leu 


Lys 


He 


Ara 


Val 


Ala 


Ser 


705 










710 










715 










720 


Lys 


Lys 


Glu 


Val 


Phe 


Glu 


Ala 


Asn 


Leu 


Lvs 


Thr 


Glu 


Cvs 


Asp 


Leu 


Tro 










725 










730 










735 




His 


Leu 


Met 


Val 


Lys 


Glu 


Met 


Trp 


Ala 


Glv 


Lys 


Lys 


Leu 


Ala 


Asp 


Asp 








740 










745 










750 






His 


Lys 


Asp 


Pro 


His 


Tyr 


Val 


Gin 


Gin 


Ala 


Leu 


Thr 


Asn 


Val 


Leu 


Leu 






755 










760 










765 








Met 


Asp 


Ala 


Val 


Val 


Gly 


Thr 


Leu 


Gin 


Ser 


Pro 


Glv 


Ala 


He 


Tvr 


Ala 




770 










775 










780 










Ala 


Ser 


Lys 


Leu 


Ser 


Tyr 


Phe 


Asp 


Lys 

2 


Met 


Ser 


Asn 


Glu 


Met 


Pro 


Met 


785 










790 










795 










800 


Thr 


Leu 


Pro 


Glu 


Thr 


Thr 


Leu 


GlU 


Thr 


Leu 


Lvs 


His 


Lys 


He 


Asn 


Pro 










805 










810 










815 




Ser 


Ala 


Gly 


Glu 


Ala 


Phe 


Pro 


Gin 


Ala 


Val 


Asp 


Val 


Leu 


Leu 




Thr 








820 










825 










830 






Pro 


Gly 


His 


Leu 


ASP 


Pro 


Ala 


Glu 


Lys 


Val 


Glu 


Asp 


Ala 


His 


Pro 


Lys 






835 










840 










845 








Leu 


Trp 


Cys 


Ala 


Leu 


Ser 


Glu 


Glv 


Lys 


Val 


Thr 


Val 


Phe 


Asn 


Ala 


Ser 




850 










855 










860 










Ser 


Trp 


Thr 


He 


His 


Gin 


His 


Ser 


Phe 


Lys 


Val 


Gly 


Thr 


Ala 


Lys 


Val 


865 










870 










875 










880 


Asn 


Cys 


Met 


Val 


Met 


Ala 


Asp 


Gin 


Asn 


Gin 


Val 




Val 




OCX 












885 










890 










895 




Asp 


Ser 


Val 


He 


Tvr 


He 


He 


Asn 


Val 


His 


Ser 


Met 


Ser 


Cys 


Asn 


Lys 








900 










905 










910 






Gin 


Leu 


Thr 


Ala 


His 


CVS 


Ser 


Ser 


Val 


Thr 


Asp 


Leu 


He 


Val 


Gin 


Asp 






915 










920 










925 








Glv 


Gin 


Glu 


Ala 


Pro 


Ser 


Asn 


Val 


Tvr 


Ser 


v,ys> 


Ser 


Met 


Asp 


Gly 






930 










935 










940 










Val 


Leu 


Val 


Trp 


Asn 


Val 


Ser 


Thr 


Leu 


Gin 


Val 


Thr 


Ser 


Arg 


■true 


Hi n 


945 










950 










955 










960 


Leu 


Pro 


Arg 


Glv 


Glv 
j 


Leu 


Thr 


Ser 


He 


Arg 


Leu 


His 


Gly 


Gly 




Leu 










965 










970 










975 




Trp 


Cys 


Cys 


Thr 


Gly 


Asn 


Ser 


He 


Met 


Val 


Met 


Lys 


Met 


Asn 


Gly 


Ser 








980 










985 










990 






Leu 


His 


Gin 


Glu 


Leu 


Lys 


He 


Glu 


Glu 


Asn 


Phe 


Lys 


Asp 


Thr 


Ser 


Thr 






995 








1000 








1005 








Ser 


Phe 


Leu 


Ala 


Phe 


Gin 


Leu 


Leu 


Pro 


Glu 


Glu 


Glu 


Gin 


Leu 


Trp 


Ala 


1010 








1015 








1020 










Ala 


Cys 


Ala 


Gly 


Arg 


Ser 


Glu 


Val 


Tyr 


He 


Trp 


Ser 


Leu 


Lys 


Asp 


Leu 



1025 1030 1035 1040 

Ala Gin Pro Pro Gin Arg Val Pro Leu Glu Asp Cys Ser Glu He Asn 

1045 1050 1055 

Cys Met He Arg Val Lys Lys Gin Val Trp Val Gly Ser Arg Gly Leu 

1060 1065 1070 

Gly Gin Gly Thr Pro Lys Gly Lys He Tyr Val He Asp Ala Glu Arg 

1075 1080 1085 

Lys Thr Val Glu Lys Glu Leu Val Ala His Met Asp Thr Val Arg Thr 

1090 1095 1100 

Leu Cys Ser Ala Glu Asp Arg Tyr Val Leu Ser Gly Ser Gly Arg Glu 
1105 1110 1115 1120 

Glu Gly Lys Val Ala He Trp Lys Gly Glu 
1125 1130 



<210> 104 
<211> 140 
<212> PRT 

<213> Homo sapiens 
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<400> 104 



Met Gly 


Gin 


Ala 


Gin Ser Lys 


Pro Thr 


Pro 


Leu Gly 


Thr 


Val 


Leu Lys 


1 






5 




10 








15 


Asn Phe 


Lys 


Lys 


Gly Ser Asn 


Gly Asp 


Tyr 


Gly He 


Ala 


Met 


Thr Pro 






20 




25 








30 




Gly Lys 


Leu 


Lys 


Ala Leu Cys 


Glu He 


Asp 


Trp Pro 


Ala 


Leu 


Glu Val 




35 






40 






45 






Gly Trp 


Pro 


Ser 


Glu Gly Ser 


Leu Asp 


Lys 


Ser Leu 


Val 


Ser 


Lys Val 


50 






55 






60 








Trp His 


Lys 


Val 


Thr Glu Phe 


Pro Val 


Asn 


Gin His 


Gin 


Leu 


Glu Asp 


65 






70 






75 






80 


His He 


Leu 


He 


Lys Gly Trp 


Lys Glu 


Arg 


Lys Leu 


Glu 


Pro 


Ala Trp 








85 




90 








95 


Glu Gly 


Pro 


Tyr 


Pro Val Leu 


Leu Thr 


Thr 


Lys Thr 


Ala 


Val 


Arg Thr 






100 




105 








110 




Ala Lys 


Lys 


Lys 


Lys Lys Lys 


Lys Lys 


Asp 


Gly Leu 


He 


Thr 


Pro Lys 




115 






120 






125 






Ser Arg 


Lys 


Cys 


His Pro Leu 
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Ser 


Thr 


Trp Gin 


Val 


Val 


Gly 


Ser 


705 








710 






Pro 


Ala 


Met Ala 


Gin 


Val 


Ala 


Glu 








725 








Ala 


He 


Ser Pro 


Cys 


His 


Val 


Leu 






740 










Met 


Gin 


Gin Gly 


Ser 


Ser 


Glu 


Thr 






755 








760 


Phe 


His 


He Phe 


Val 


Phe 


Thr 


Lys 




770 








775 




Arg 


Ala 


Gin Leu 


Arg 


Thr 


His 


Pro 


785 








790 






Gly 


Phe 


Phe Ser 


Cys 


Gly 


Leu 


Gly 








805 










<210> 115 












<211> 241 












<212> PRT 












<213> Homo 


sapiens 








<400> 115 










Met 


Val 


Ser Lys 


Arg 


Arg 


Leu 


Ser 


1 






5 








Thr 


Glu 


Asp Ala 


Ser 


Lys 


Thr 


Arg 






20 










Lys 


Lys 


Ser His 


lie 


Ala 


Asn 


Glu 






35 








40 


Val 


Lys 


Leu Leu 


Lys 


He 


Ser 


Gly 




50 








55 




Gin 


Asn 


Gin Leu 


Ala 


Val 


Asp 


Gin 


65 








70 






Gin 


Thr 


Leu Arg 


Arg 


His 


Pro 


Ser 








85 








Val 


Ser 


Gly Leu 


Glu 


Ser 


Tyr 


He 





490 










495 




Pro 


Ala 


Arcr 


Arcr 


Pro 


Ser 


Ser 


Ala 


505 










510 






Ala 


Ala 


Thr 


Tvr 


Ser 


Ala 


Pro 


Glv 










525 








Pro 


Ser 


Asn 


Ser 


Glv 

<* 


Tvr 


Ser 


Ser 








540 










Arcr 


Leu 


Ser 


Pro 


Thr 


Tvr 


Pro 


Leu 






555 










560 


Pro 


Gly 


Pro 


Ser 


Asn 


Pro 


Leu 


His 




570 










575 




Thr 


Ala 


Tyr 


Thr 


Val 


Pro 


Met 


Glu 


585 










590 






Ala 


Ala 


Ser 


Pro 


Leu 


Pro 


Ser 


Pro 










605 








Lys 


Pro 


Glv 


Ala 


Pro 


Leu 


Glv 


Pro 








620 










Ser 


Leu 


His 


Thr 


Pro 


His 


Ser 


Pro 






635 










640 


Glv 


Ala 


Pro 


He 


He 


Val 


Ser 


Thr 




650 










655 




Phe 


Ala 


Glv 


Gin 


Ala 


Val 


Gin 


Ser 


665 










670 






Ala 


Glv 


Thr 


Pro 


Val 


Arg 


Pro 


Thr 










685 








Glv 


Leu 


Gin 


Ala 


Arg 


v ct J. 


oer 










700 










Ala 


Val 


Ala 


Leu 


Airg 


ijeu 


inr 








715 










720 


Pro 


Ser 


Gly 


Gly 


Gly 


Cys 


Glu 






730 










735 




Ser 


Pro 


Glu 


<£rJL \J 




lieu 


nib 




745 










750 






Thr 


Asn 


Glu 


Trp 


Gly 


Cys 


Gly 


His 










765 








Tyr 


Ser 


Gin 


Ala 


Cys 


Ser 


Leu 


His 








780 










Val 


Thr 


Arg 


Ala 


Gly 


His 


Ser 


His 






795 










800 


Phe 


Gin 


Gin 


Leu 


Glu 


Val 








810 














Lys 


Ser 


Glu 


Asp 


Lys 


Glu 


Ser 


Leu 




10 










15 




Lys 


Gin 


Pro 


Leu 


Ser 


Lys 


Lys 


Thr 


25 










30 






Val 


Glu 


Glu 


Asn 


Asp 


Ser 


He 


Phe 










45 








He 


He 


Leu 


Lys 


Thr 


Gly 


Glu 


Ser 








60 










He 


Ala 


Phe 


Gin 


Lys 


Lys 


Leu 


Phe 






75 










80 


Tyr 


Pro 


Thr 


He 


He 


Glu 


Glu 


Phe 




90 










95 




Glu 


Asp 


Glu 


Asp 


Ser 


Leu 


Arg 


Asn 
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100 










105 


Cys 


Leu 


Leu 


Ser 


Cvs 


Glu 


Arcr 


Leu 


Gin 






115 










120 




Ala 


Ser 


Tyr 


Ser 


Lvs 

2 


Ser 


Leu 


He 


Lvs 




130 










135 






Leu 


Gin 


Pro 


Ala 


He 


He 


Lys 


Thr 


Leu 


145 










150 








Phe 


Phe 


Glu 


Asn 


Lys 


Asn 


Ser 


ASP 


Glu 










165 










Val 


Ser 


Gin 


Leu 


Lvs 


Trp 


Leu 


Asp 


Arcr 








180 










185 


Thr 


Thr 


Lys 


He 


Met 


Gin 


Leu 


He 


Ser 






195 










200 




His 


Asp 


He 


He 


Thr 


Ser 


Leu 


Pro 


Glu 




210 










215 






Ala 


Asp 


Val 


Gly 


LVS 


Glu 


Leu 


Arcr 


Tro 


225 










230 








Lys 




















<210> 116 














<211> 396 














<212> PRT 














<213> Homo 


sapiens 










<400> 116 












Met 


Val 


Glu 


Arg 


Arg 


Pro 




Leu 


Asp 


l 








5 










Thr 


Asn 


His 


Arg 


Gly 


Pro 


Val 


Asp 


Glv 








20 










25 


Asn 


Gin 


Ala 


Asn 


Asn 


Pro 


Pro 


Ala 


Asn 






35 










40 




His 


Pro 


Gly 


Ser 


Asp 


Pro 


Arcr 


Ala 


Asn 




50 










55 






Gin 


Arg 


Glu 


Glu 


Arg 


Phe 


Arg 


Ala 


Met 


65 










70 








Ser 


Glu 


Glu 


Gin 


Glu 


Trp 


His 


Val 


Cys 










85 










His 


Cys 


Gin 


Gly 


Asn 


Asp 


Thr 


Arcr 










100 










105 


Arg 


Asn 


Arg 


Arg 


Pro 


Pro 


Tro 


Ser 


Asn 






115 










120 




Thr 


Pro 


His 


Gin 


Lys 


Pro 


Thr 


Glu 


Gin 




130 










135 






Gly 


Tyr 


Lys 


Phe 


Leu 


Glu 


Ser 


Leu 


Leu 


145 










150 








Val 


lie 


Thr 


Leu 


Ala 


Thr 


Ser 


Leu 


Glv 










165 










Ser 


Ser 


Met 


Lys 


Ser 


Asn 


Phe 


Leu 


Glu 








180 










185 


Lys 


Ala 


Cys 


Ser 


Ser 


Lys 


Met 


Asp 


Arcr 






195 










200 




Gly 


He 


Leu 


Lys 


Asn 


Ser 


Lys 


Phe 


Leu 




210 










215 






Val 


Val 


Gly 


Met 


He 


Thr 


Glu 


Pro 


He 


225 










230 








Pro 


Glu 


His 


He 


Ser 


Asn 


He 


He 


Ser 










245 










Val 


Phe 


Pro 


Ala 


Ser 


Ser 


Val 


Gin 


Glu 








260 










265 


Leu 


Pro 


Thr 


Ser 


Leu 


Asn 


Ala 


Leu 


Arg 
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110 



Asp 


Glu 


Glu 


Ala 


Ser 


Met 


Gly 








125 








Leu 


Leu 


Leu 


Gly 


He 


Asp 


He 






140 










Phe 


Glu 


Lys 


Leu 


Pro 


Glu 


Tyr 




155 










160 


He 


Asn 


He 


Pro 


Arg 


Leu 


He 


170 










175 




Val 


Val 


Asp 


Gly 


Lys 


Asp 


Leu 










190 






He 


Ala 


Pro 


Glu 


Asn 


Leu 


Gin 








205 








He 


Leu 


Gly 


Asp 


Ser 


Gin 


His 






220 










He 


Asn 


Pro 


Leu 


Ser 


Ser 


Ser 




235 










240 


Ala 


Arg 


Pro 


Arcr 


Asn 


Ser 


His 


10 










15 




Glu 


Leu 


Pro 


Pro 


Arg 


Ala 


Arg 










30 






Ala 


Leu 


Arg 


Gly 


Gly 


Ala 


Ser 








45 








Asn 


His 


Pro 


Ala 


Ala 


Tyr 


Cys 






60 










Gly 


Trp 


Asn 


Pro 


His 


Gin 


Gly 




75 










80 


Asp 


Glu 


Ala 


Arcr 


Asp 


Gin 


Arcr 


90 










95 




Arg 


Asn 


Gly 


Asn 


Gin 


Asp 


Cvs 










110 






Asp 


Asn 


Phe 


Gin 


Gin 


Trp 


Arg 








125 








Pro 


Gin 


Gin 


Ala 


Lys 


Lys 


Leu 






140 










Gin 


Lys 


Asp 


Pro 


Ser 


Glu 


Val 




155 










160 


Leu 


Lys 


Glu 


Leu 


Leu 


Ser 


His 


170 










175 




Leu 


He 


Cys 


Gin 


Val 


Leu 


Arg 










190 






Gin 


Ser 


Val 


Leu 


His 


Val 


Leu 








205 








Lys 


Val 


Cys 


Leu 


Pro 


Ala 


Tyr 






220 










Pro 


Asp 


He 


Arg 


Asn 


Gin 


Tyr 




235 










240 


Leu 


Leu 


Gin 


Asp 


Leu 


Val 


Ser 


250 










255 




Thr 


Ser 


Met 


Leu 


Val 


Ser 


Leu 










270 






Ala 


Ser 


Gly 


Val 


Asp 


He 


Glu 
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275 280 285 

Glu Glu Thr Glu Lys Asn Leu Glu Lys Val Gin Thr He He Glu His 

290 295 300 

Leu Gin Glu Lys Arg Arg Glu Gly Thr Leu Arg Val Asp Thr Tyr Thr 
305 310 315 320 

Leu Val Gin Ala Glu Glu Arg Gly Arg Met Leu Arg Ala Thr Leu Thr 

325 330 335 

Met Pro Arg Tyr Pro Thr Tyr Thr Glu Ala His Leu Gly Glu Glu Ala 

340 345 350 

Leu Pro Ser Pro Gin. Tyr His Phe Trp Lys He Arg Gin His Cys Tyr 

355 360 365 

Leu Ser Gly Tyr Pro Leu Pro Ala Ser Gly Arg Arg Phe Arg Gin Thr 

370 375 380 

Phe Thr Gly Arg Tyr Phe Gly Thr Ser Pro Lys Leu 
385 390 395 



<210> 117 
<211> 153 
<212> PRT 

<213> Homo sapiens 



<400> 117 

Met Gly Trp Leu Phe Leu Lys Val Leu Val Ala Gly Glu Ser Phe Ser 

15 10 15 

Gly Leu Leu Tyr Pro Leu Val Asp Phe Cys He Ser Gly Lys Thr Arg 

20 25 30 

Gly Gin Lys Pro Asn Phe Val He He Leu Ala Asp Asp Met Gly Trp 

35 40 45 

Gly Asp Leu Gly Ala Asn Trp Ala Glu Thr Lys Asp Thr Ala Asn Leu 

50 55 60 

Asp Asn Met Ala Ser Glu Gly Met Arg Phe Val Asp Phe His Ala Ala 
65 70 75 80 

Ala Ser Thr Cys Ser Pro Ser Arg Ala Ser Leu Leu Thr Gly Arg Leu 

85 90 95 

Gly Leu Arg Asn Gly Val Thr Arg Asn Phe Ala Val Thr Ser Val Gly 

100 105 110 

Gly Leu Pro Leu Asn Glu Thr Thr Leu Ala Glu Val Leu Gin Gin Ala 

115 120 125 

Gly Tyr Val Thr Gly He He Gly Lys Trp His Leu Gly His His Gly 

130 135 140 

Ser Tyr Gin Pro Arg Val Pro Trp Ser 
145 150 



<210> 118 
<211> 169 
<212> PRT 

<213> Homo sapiens 





<400> 118 








Met 


Pro Pro Asn 


Asn Cys 


Tyr 


Val 


1 




5 






Val 


Glu Asp Val 


Ala Arg 


Val 


Asp 




20 








Ala 


His Asp Thr 


He Gin 


Asp 


Asp 




35 






40 


Asn 


Glu Lys Glu 


Ala Trp 


Tyr 


Arg 




50 




55 




Asp 


Leu Ser Gin 


Leu Arg 


Tyr 


Glu 



Asn 


Phe Glu Arg 


Phe 


Ala 


Cys Val 




10 






15 


Leu 


Gly Cys Arg 


Ala 


Leu 


Val Glu 


25 






30 




Val 


Glu Ala Leu 


Val 


Ser 


He Phe 






45 






Glu 


Glu Asn Glu 


Ser 


Ala 


Arg His 




60 








Phe 


Arg Lys Val 


Glu 


Ser 


Leu Lys 
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65 70 75 80 

Lys Leu Leu Arg Glu Asp lie Gin Ala Thr Gly Cys Ser Leu Gly Ser 

85 90 95 

Met Ala Arg Lys Leu Asp His Leu Gin Ala Gin Phe Glu lie Leu Arg 

100 105 no 

Gin Glu Leu Ser Ala Asp Leu Gin Trp lie Gin Glu Leu Val Gly Ser 

115 120 125 

Phe Gin Leu Glu Ser Gly Ser Ser Glu Gly Leu Gly Ser Thr Phe Tyr 

130 135 140 

Gin Asp Thr Ser Glu Ser Leu Ser Glu Leu Leu Ser Arg Ser Cys Thr 
145 150 155 160 

Glu Glu Phe Leu Ala Gly Trp Lys Leu 
165 



<210> 119 

<211> 51 

<212> PRT 

<213> Homo sapiens 



<400> 119 

Met Val Pro Val Phe Ser Val Glu Lys Asp Gly Glu Glu Leu Gly Ser 

15 10 15 

Phe Arg Pro Arg Trp Ala Asp Trp Leu Thr Gly Leu Leu Glu Trp Val 

20 25 30 

Ser Val Glu Ser Leu Ser lie Tyr Cys lie Ser Gin Pro Val Tyr Met 

35 40 45 

Trp Val Glu 
50 



<210> 120 
<211> 169 
<212> PRT 
<213> Homo sapiens 



<400> 120 

Met Gly Ser Lys Gly Arg Ser Leu Asp Ala Arg Ser Arg Gly Gly Arg 

1 5 10 15 

Thr Ser Met Arg Lys Pro Leu Ala Glu Asn Gly Arg Ser Ser Ala Ala 

20 25 30 

Ser Gin Pro Gin Leu Pro Gly Arg Cys Ser Arg Asp He Gly Gly Val 

35 40 45 

Asn He Gin Lys Cys Asp Cys Leu Thr Gin Pro Arg Ala Leu Ala He 

50 55 60 

He Lys Arg Cys Ser Asp Gly Ala Val Gin Glu Cys Asp Ala Gly Glu 
65 70 75 80 

Leu Glu Gin Gin Ser His He Ser Thr Ser Arg Pro Thr Ala Val Ser 

85 90 95 

His Thr Leu Glu Pro Ser Phe Ala Gin Ser His Met His Asp Trp Asp 

100 105 110 

Arg Gly Phe Arg Pro Leu Pro Thr Pro His Ala Gly Ser Val Pro Asp 

115 120 125 

Ala Gin Val Pro His Trp Gly Ala Leu Ser Arg Leu Leu His Thr Leu 

130 135 140 

Arg Ser Cys Pro Pro Gin Glu Arg Leu Arg Ala Ser Ser Val Lys Trp 
145 150 155 160 

Asn Ser Lys Gin Pro Pro Gly His Ser 
165 
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<210> 121 
<211> 428 
<212> PRT 

<213> Homo sapiens 



<400> 121 
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Ser 


Val 


Asp 


Tvr 


Gin 


Ala 


Ser 
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Tvr 
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Lvs 
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Asn 


Phe 


Val 








20 










Asp 
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Lvs 


Glu 


Glv 


Ala 


Phe 


Ala 






35 










40 


Asp 
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Glu 


Ala 




Gly 


Phe 


Val 




50 










55 




Ser 


Ala 


Asp 


Gin 


His 


Lys 


He 


Met 


65 
















Glu 


Phe 


Ser 




Ala 


Ser 


Glu 


Cys 










o o 








Phe 


Ala 




Glu 




Tl 


Lys 


Lys 
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Vet J. 
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Tri-r 

XXIX 


Olll 


1 ill. 


VjJL U 
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Gly 
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Ser 
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Al 5* 


lie 














JL J 3 




Val 


Glu 


Lys 


Ser 




Tip 


VJX u 


nib 


X*± J 










JL O U 






Ser 


1 Y X 


x-*x v_J 


His 


Arg 


Lys 


Leu 


Lys 










165 








Met 


Glu 


His 


He 


Gin 


Asp 


Gin 


ill ct 








180 










Pro 


Asp 


Glu 


Ser 


Ala 


Asp 


Thr 


Asp 






195 










200 


Thr 


Pro 


Lvs 


Leu 


He 


Lvs 


Ala 


Lys 




210 










215 




Leu 


Lvs 


Thr 


Leu 


Glu 


Glu 


Leu 


Cys 


225 










230 






Leu 


Ala 


Gin 


Leu 


Ser 


His 


He 


Glu 










245 








Tyr 


Leu 


Leu 


Thr 


Ser 


Gin 


He 


Asp 








260 










lie 


Thr 


Asn 


Phe 


Leu 


Phe 


Glu 


Asp 






275 










280 


Val 


Glu 


Lys 


Gin 


Glu 


Ser 


He 


Pro 




290 










295 




Pro 


Ser 


Ser 


Ser 


Leu 


Glu 


Glu 


Cys 


305 










310 






Val 


Gly 


Ser 


Tyr 


Lys 


Ser 


Ser 


Val 










325 








Gin 


Glu 


Leu 


Gly 


Asp 


Glu 


Glu 


Tyr 








340 










Ser 


Ser 


Phe 


Asp 


Pro 


Gin 


Glu 


Asn 






355 










360 


Lys 


Gly 


Ser 


He 


Ser 


Ser 


Gly 


Glu 




370 










375 




Lys 


Ser 


Thr 


Ser 


Val 


Leu 


Ser 


Lys 


385 










390 






Val 


Pro 


Leu 


Ser 


Pro 


Gin 


Val 


Val 










405 








Thr 


He 


Ser 


Glu 


Asp 


Ser 


He 


Glu 



420 



Phe 


Val 


Glv 


His 


Pro 


Pro 


Glv 


Ser 




10 










15 




Glu 


Asp 


Ser 


Lvs 


Val 


Val 


Leu 


Glv 


25 










30 






Tvr 


Val 


His 


His 


Leu 


Thr 


Leu 


Tvr 










45 








Arcr 


Pro 


Phe 


Cys 


Met 


Ala 


Tyr 


lie 








60 










Gin 


Gin 


Phe 


Gin 


Glu 


Leu 


Ser 


Ala 






75 










an 


Leu 


Lys 


Thr 


Glv 


Asn 




Lys 


AXa 




on 














Leu 


Lys 




L6U 


ASp 


Tyr 


X ixx 


Arg 


inc 










XJLU 






He 


Gin 


Lys 


xiy s 


Ala 


Asn 


Asp 


Lys 










IOC 








Glu 


Lys 


2X1 a 
nld 


Asn 


uxu 


Leu 




Ser 








i a n 
x*tU 










Gin 


Asp 


Leu 


Leu 


Lys 


VjXIl 


XX ti 


Arg 






Xjj 










J.OU 


Gly 


His 


Asp 


Leu 


Cys 


Pro 




IvXIX 




JL / \J 










X /b 




Ser 


Gin 


Ala 


C £} >- 

OCX. 


Thr 
xxxx 


Thr* 
xxxx 


Ser 


Asn 


185 










X 2? v 






Leu 


xyx 


Thr 


Cys 




— X. 


J-VXd 


Tyr 










205 








Ser 


Thr 


Lys 


v-yt> 


Phe 


Asp 


Lys 


Ti\7C 

XJ_y o 








220 










Asp 


Thr 


Glu 


Tvr 


Phe 


Thr 


Gin 


Thr 






235 










240 


His 


Met 


Phe 


Arcr 


Glv 


Asp 


Leu 


Cys 




250 










255 




Arcr 


Ala 


Leu 


Leu 


LVS 


Gin 


Gin 


His 


265 










270 






Phe 


Val 


Glu 


Val 


Asp 


Asp 


Arcr 


Met 










285 








Ser 


Lvs 

**- J Jr *-* 


Pro 


Ser 


Gin 


Asp 


Ajtct 
-rtxy 


Pro 








300 










Pro 


He 


Pro 


Lys 


Val 


Leu 


He 


Ser 






315 










320 


Glu 


Ser 


Val 


Leu 


He 


Lys 


Met 


Glu 




330 










335 




Lys 


Glu 


Val 


Glu 


Val 


Thr 


Glu 


Leu 


345 










350 






Leu 


Asp 


Tyr 


Leu 


Asp 


Met 


Asp 


Met 










365 








Ser 


He 


Glu 


Val 


Leu 


Gly 


Thr 


Glu 








380 










Ser 


Asp 


Ser 


Gin 


Ala 


Ser 


Leu 


Thr 






395 










400 


Arg 


Ser 


Lys 


Ala 


Val 


Ser 


His 


Arg 




410 










415 




Val 


Leu 


Ser 


Thr 











425 
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<210> 122 
<211> 168 
<212> PRT 

<213> Homo sapiens 



<400> 122 



Met 


Gly 


Glu 


Glu 


Ala 


Val 


Arg 


Trp 


Ala 


Lys 


Leu 


Val 


He 


Pro 


Leu 


Val 


1 








5 










10 










15 




Val 


His 


Ser 


Ala 
20 


Gin 


Lys 


Val 


His 


Leu 
25 


Arg 


Gly 


Ala 


Thr 


Ala 
30 


Leu 


Glu 


Met 


Gly 


Met 
35 


Pro 


Leu 


Leu 


Leu 


Gin 
40 


Lys 


Gin 


Gin 


Glu 


He 
45 


Ala 


Ser 


He 


Thr 


Glu 

50 


Gin 


Leu 


Met 


Thr 


Thr 
55 


Thr 


Leu 


His 


Arg 


Ser 
60 


Gly 


Ser 


Phe 


He 


Asn 


Ser 


Leu 


Leu 


Gin 


Leu 


Glu 


Glu 


Leu 


Gly 


Phe 


Arg 


Ser 


Gly 


Ala 


Pro 


65 










70 










75 










80 


Met 


He 


Lys 


Lys 


lie 
85 


Ala 


Phe 


He 


Ala 


Trp 
90 


Lys 


Ser 


Leu 


He 


Asp 
95 


Asn 


Phe 


Ala 


Leu 


Asn 
100 


Pro 


Asp 


He 


Leu 


Cys 
105 


Ser 


Ala 


Lys 


Arg 


Leu 
110 


Lys- 


Leu 


Leu 


Met 


Gin 
115 


Pro 


Leu 


Ser 


Ser 


He 
120 


His 


Val 


Arg 


Thr 


Glu 
125 


Thr 


Leu 


Ala 


Leu 


Thr 


Lys 


Leu 


Glu 


Val 


Trp 


Trp 


Tyr 


Leu 


Leu 


Met 


Arg 


Leu 


Gly Pro 




130 










135 










140 










His 


Leu 


Pro 


Ala 


Asn 


Phe 


Glu 


Gin 


Gly 


Cys 


Val 


Pro 


Leu 


He 


Gin 


Ser 



145 150 
????????????????????????????????????????????????????????????????????????????? 
??????????????????????????????????????????????????? 55 
60 

Ser Leu Asp Phe Lys Phe Gly Arg Arg Ser Asn Arg Ala Asp Glu Leu 
65 

????????????????????????????????????????????????????????????????????????????? 
??????????????????????????????????????????????????? 85 
90 95 



Ala 


Thr 


Ala 


Gly 


Asn 


Arg 


Gly 


Leu 


Ala 


Phe 


Pro 


Ser 


Arg 


His 


He 


Asm 


ksp Thr Asp 
































35 










40 










45 








Ser 


Glu 
50 


Thr 


Ser 


Ser 


Pro 


Ala 
55 


Pro 


Ser 


Pro 


Val 


Gin 
60 


Pro 


Pro 


Phe 


Phe 


Ser 


Glu 


Cys 


Ser 


Leu 


Gly 


Tyr 


Phe 


Ser 


Pro 


Ala 
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Ser 


Glu Phe Ala 
25 


Ser 


Val 


Cys 


Thr 


Ser 
30 


Asn 


Ser 


He 


Leu 


Asp 
35 


Leu 


Phe 


Arg 


Thr Pro Ala 
40 


He 


Arg 


Lys 


Val 
45 


Thr 


Cys 


Cys 


Leu 


Met 
50 


Val 


He 


Trp 


Arg 


Met Ala Pro 
55 


Pro 


Ala 


Gly 
60 


Arg 


Glu 


Leu 


Arg 


He 


Ala 


Ala 


Glu 


Ser 


Leu 


Ser Gin Gin 


Lys 


Arg 


Ala 


Phe 


Ala 


Val 


Ser 


65 










70 






75 










80 


Arg 


Arg 


He 


Gin 


Glu 
85 


Arg 


Thr Phe Ser 


Ser 
90 


Gly 


He 


Leu 


Asn 


Ser 
95 


Gly 


Ser 


Val 


Ser 


Pro 
100 


Ser 


Arg 


Lys Glu Glu 
105 


Gly 


Gly 


Arg 


Arg 


Ala 
110 


Ser 


Pro 
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Gly Arg Gin Gly Leu Pro Gin Glu Asp Ala His Ser Trp Thr Arg Val 

115 120 125 

Arg Arg Ser Pro Leu Ala Pro Gin Ser Arg Asn Cys Ala Ala Cys His 

130 135 140 

Ala Arg Leu Thr Pro Arg Lys Ser Arg Ala Thr 
145 150 155 



<210> 131 
<211> 145 
<212> PRT 

<213> Homo sapiens 



<400> 131 



Met 


Leu 


Asp 


Gin Cys Arg 


Thr 


Leu 


Ala 


Ser 


Arg 


Gly 


Thr 


Pro 


Pro Cys 


1 








5 










10 










15 


Lys 


Pro 


Ser 


Cys Val 


Ser 


Gin 


Leu 


Gly 


Gin 


Arg 


Ala 


Glu 


Pro 


Lys Ala 








20 










25 










30 




Thr 


Glu 


Arg 


Gly He Leu 


Arg 


Ala 


Thr 


Cys 


Val 


Ala 


Trp 


Glu 


Ser Gin 






35 










40 










45 






Leu 


Lys 
50 


Pro 


Glu 


Glu 


Leu 


Pro 
55 


Ser 


Met 


Gin 


Asp 


Leu 
60 


Leu 


Glu 


Glu Ala 


Ser 


Ser 


Arg 


Asp 


Met 


Gin 


Met 


Gly 


Pro 


Gly 


Leu 


Phe 


Leu 


Arg 


Met Gin 


65 










70 










75 








80 


Leu 


Val 


Pro 


Ser 


He 
85 


Glu 


Glu 


Arg 


Glu 


Thr 
90 


Pro 


Leu 


Thr 


Arg 


Glu Asp 
95 


Arg 


Pro 


Ala 


Leu 
100 


Gin 


Glu 


Pro 


Pro 


Trp 
105 


Ser 


Leu 


Gly 


Cys 


Thr 
110 


Gly Leu 


Lys 


Ala 


Ala 
115 


Met 


Gin 


He 


Gin 


Arg 
120 


Val 


Val 


He 


Pro 


Val 
125 


Pro 


Thr Leu 


Gly 


His 
130 


Arg 


Asn 


Pro 


Trp 


Val 
135 


Ala 


Arg 


Asp 


Ser 


Gly 
140 


Ala 


He 


Gly Asn 



Gly 
145 



<210> 132 
<211> 288 
<212> PRT 
<213> Homo sapiens 



<400> 132 

Met Asp Ala Ala Val Thr Asp Asp Phe Gin Gin He Leu Pro He Glu 

15 10 15 

Gin Leu Arg Ser Thr His Ala Ser Asn Asp Tyr Val Glu Arg Pro Pro 

20 25 30 

Ala Pro Cys Lys Gin Ala Leu Ser Ser Pro Ser Leu He Val Gin Thr 

35 40 45 

His Lys Ser Asp Trp Ser Leu Ala Thr Met Pro Thr Ser Leu Pro Arg 

50 55 60 

Ser Leu Ser Gin Cys His Gin Leu Gin Pro Leu Pro Gin His Leu Ser 
65 70 75 80 

Gin Ser Ser He Ala Ser Ser Met Ser His Ser Thr Thr Ala Ser Asp 

85 90 95 

Gin Arg Leu Leu Ala Ser He Thr Pro Ser Pro Ser Gly Gin Ser He 

100 105 110 

He Arg Thr Gin Pro Gly Ala Gly Val His Pro Lys Ala Asp Gly Ala 

115 120 125 

Leu Lys Gly Glu Ala Glu Gin Ser Ala Gly His Pro Ser Glu His Leu 
130 135 140 
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Phe 


He 


Cys 


Glu 


Glu Cys Gly Arg 


Cys 


145 










150 




Ala 


Arg 


Pro 


Leu 


Pro 


Ser Cys Trp 


Leu 










165 






Ser 


Ala 


Glu 


Ser 


Leu Leu Asp Tyr 


Gly 








180 






185 


Gly 


Leu 


Phe 


Tyr 


His 


Cys Ser Thr 


Asp 






195 






200 




Glu 


Pro 


Cys 


Ser 


Cys 


Gly Pro Ser 


Ser 




210 








215 




Met 


Ser 


Leu 


He 


Ser 


Leu Phe Leu 


Pro 


225 










230 




Thr 


Arg 


Gly 


Cys 


Leu 


His Leu Cys 


Gin 










245 






Arg 


Pro 


Gly 


Cys 


Arg 


Cys Lys Arg 


His 








260 






265 


He 


Ser 


Ser 


Gly 


Ser 


Ala Pro Phe 


Pro 






275 






280 






<210> 133 










<211> 255 










<212> PRT 










<213> Homo 


sapiens 






<400> 133 








Met 


Glu 


Asn 


Glu 


Lys 


Glu Asn Leu 


Phe 


1 








5 






Leu 


Met 


Lys 


Thr 


Pro 


Leu Lys Glu 


Ser 








20 






25 


Ala 


Glu 


He 


Gin 


Pro 


Asp Phe Gly 


Pro 






35 






40 




Lys 


Glu 


Gly 


Ser 


Gin 


Gly Glu Pro 


Trp 




50 








55 




Met 


Leu 


He 


Ser 


Ala 


Val Ser Pro 


Glu 


65 










70 




Arg 


Gly 


Leu 


Phe 


Asp 


Asn Arg Ser 


Gly 










85 






He 


His 


Glu 


His 


Leu 


Ser Gly Asp 


Glu 








100 






105 


Arg 


Lys 


Glu 


Lys 


Ser 


Leu Gly Leu 


Leu 






115 






120 




Tyr 


Pro 


Asn 


Tyr 


Pro 


Asn Pro Ala 


Val 




130 








135 




Glu 


Val 


Ala 


Glu 


Glu 


Leu Asn Val 


Glu 


145 










150 




Val 


Asn 


Val 


Leu 


Glu 


Ser Leu His 


Met 










165 






Arg 


Tyr 


Thr 


Trp 


His 


Gly Arg His 


Asn 








180 






185 


Leu 


Lys 


Ser 


He 


Gly 


Glu Glu Asn 


Lys 






195 






200 




He 


Lys 


Lys 


Lys 


Glu 


Tyr Glu Gin 


Glu 




210 








215 




Ser 


He 


Glu 


Asp 


His 


He He Lys 


Ser 


225 










230 




Pro' 


Asp 


Met 


Cys 


Phe 


Val Glu Leu 


Pro 



245 
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Lys 


Cys 


Val 


Pro 


Cys 


Thr 


Ala 




155 










160 


Cys 


Asn 


Gin 


Arg 


Cys 


Leu 


Cys 


170 










175 




Thr 


Cys 


Leu 


Cys 


Cys 


Val 


Lys 










190 






Asp 


Glu 


Asp 


Asn 


Cys 


Ala 


Asp 








205 








Cys 


Phe 


Val 


Arg 


Trp 


Ala 


Ala 






220 










Cys 


Leu 


Cys 


Cys 


Tyr 


Leu 


Pro 




235 










240 


Gin 


Gly 


Tyr 


Asp 


Ser 


Leu 


Arg 


250 










255 




Thr 


Asn 


Thr 


Val 


Cys 


Arg 


Lys 










270 






Lys 


Ala 


Gin 


Glu 


Lys 


Ser 


Val 








285 








Cys 


Glu 


Pro 


His 


Lys 


Arcr 


Gly 


10 










15 




Thr 


Thr 


Ala 


Asn 


He 


Val 


Leu 










30 






Leu 


Thr 


Thr 


Pro 


Thr 


Lvs 


Pro 








45 








Thr 


Pro 


Thr 


Ala 


Asn 


Leu 


Lys 






60 










He 


Arg 


Asn 


Arg 


Asp 


Gin 


Lys 




75 










80 


Leu 


Pro 


Glu 


Ala 


Lys 


Asp 


Cys 


90 










95 




Phe 


Glu 


Lys 


Ser 


Gin 


Pro 


Ser 










110 






Cys 


His 


Lys 


Phe 


Leu 


Ala 


Arg 








125 








Asn 


Asn 


Asp 


He 


Cys 


Leu 


Asp 






140 










Arg 


Arg 


Arg 


lie 


Tyr 


Asp 


He 




155 










160 


Val 


Ser 


Arg 


Leu 


Ala 


Lys 


Asn 


170 










175 




Leu 


Asn 


Lys 


Thr 


Leu 


Gly 


Thr 










190 






Tyr 


Ala 


Glu 


Gin 


He 


Met 


Met 








205 








Phe 


Asp 


Phe 


He 


Lys 


Ser 


Tyr 






220 










Asn 


Thr 


Gly 


Pro 


Asn 


Gly 


His 




235 










240 


Gly 


Val 


Glu 


Phe 


Arg 


Ala 





250 255 
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<210> 134 
<211> 68 
<212> PRT 

<213> Homo sapiens 



<400> 134 

Met Gly Thr Arg Pro Leu His Gly His Lys Leu Ser Arg Trp Asn lie 

1 5 10 15 

Met Ser Ser Asn Phe Ser Ser Thr Gly Leu Lys Asp Phe Asn Thr Pro 

20 25 30 

Lys Ala Leu Phe lie Arg Lys Gly Leu Gly Arg Gly Glu Lys Thr Leu 

35 40 45 

Val His Phe lie Gin Val Phe Ser Tyr Thr Arg Lys Gin Ser Ser His 

50 55 60 

Ser His Glu Asn 
65 



<210> 135 
<211> 211 
<212> PRT 
<213> Homo sapiens 



<400> 135 

Met Tyr Asp lie Phe Asn Leu Asn Asp Lys Ala Leu Cys Phe Thr Lys 

15 10 15 

Cys Arg Gin Ser Gly Ser Asp Ser Cys Asn Val Glu Asn Leu Gin Arg 

20 25 30 

Tyr Trp Leu Asn Tyr Glu Ala His Leu Met Lys Glu Gly Leu Thr Gin 

35 40 45 

Lys Val Asn Thr Pro Phe Leu Lys Ala Leu Val Gin Asn Leu Ser Thr 

50 55 60 

Asn Thr Ala Glu Asp Phe Tyr Leu Ser Leu Glu Pro Ser Gin Val Pro 
65 70 75 80 

Arg His Val Met Lys Asp Glu Asp Lys Pro Pro Asp Arg Val Arg Leu 

85 90 95 

Pro Lys Ser Leu Phe Arg Ser Leu Pro Gly Asn Arg Ser Val Val Arg 

100 105 110 

Leu Ala Val Thr lie Leu Asp lie Gly Pro Gly Thr Leu Phe Lys Gly 

115 120 125 

Pro Arg Leu Gly Leu Gly Asp Gly Ser Gly Val Leu Asn Asn Arg Leu 

130 135 140 

Val Gly Leu Ser Val Gly Gin Met His Val Thr Lys Leu Ala Glu Pro 
145 150 155 160 

Leu Glu lie Val Phe Ser His Gin Arg Pro Pro Pro Lys Pro Thr Met 

165 170 175 

His Ser Glu lie Thr Ser Ser He Phe Asn Arg He Ser Met Thr Cys 

180 185 190 

Thr Thr Tyr Tyr Tyr Ser Ser Thr Arg Tyr Lys Tyr Asn Asn He He 

195 200 205 

His Lys Asp 
210 



<210> 136 
<211> 147 
<212> PRT 
<213> Homo sapiens 
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<400> 136 



Met 


Ser 


Cys 


Met 


Cys 


1 








5 


Gin 


Ala 


Leu 


Leu 


Leu 








20 




Glu 


Ser 


He 


Gin 


Ala 






35 






Ser 


Leu 


Trp 


Thr 


Arg 




50 








Gin 


Gly 


Arg 


Gly 


Thr 


65 










Leu 


Cys 


Glu 


Gly 


Leu 










85 


Ser 


Gly 


Glu 


Val 


lie 








100 




Pro 


Ala 


Ala 


Leu 


Pro 






115 






Pro 


Pro 


Leu 


Pro 


Pro 




130 








Val 


Val 


Pro 







145 



Trp Pro Asn Met Leu Asn 
10 

Leu Lys Leu Leu He Met 
25 

Gly Arg Arg Gin Val Leu 
40 

Tyr Leu Ala Glu Leu Lys 
55 

Gin Leu Glu Asn Val Ala 
70 75 
Phe Asp Pro Tyr Gin Thr 
90 

Ser Ser Lys Glu Lys Ser 
105 

Gin Glu Phe Ser Ala Phe 
120 

Leu Gly Ser Thr Pro Lys 
135 



His Gly Glu 


Leu 


Glu 




15 




Leu Cys Thr 


Asn 


Leu 


30 






Glu His Arg 


Val 


Leu 


45 






Gly Cys Pro 


Pro 


Pro 


60 






Leu His Ala 


Leu 


Leu 






80 


Trp Arg Arg 


Gin Arg 




95 




Lys Tyr Lys 


Phe 


Pro 


110 






Phe Gin Gly 


Gin 


Ala 


125 






Pro Arg Pro 


Leu 


Pro 


140 







<210> 137 

<2ll> 36 

<212> PRT 

<213> Homo sapiens 



<400> 137 
Met Ser Gly Arg Val Phe Arg Cys 

1 5 
Leu Ser Glu Leu Phe Thr Glu Ala 
20 

Glu Gly Gin Lye 
35 



Gin Ala Leu Val Ala Tyr Thr Val 

10 15 
Lys Glu Gin Arg Leu Ala Thr Asp 
25 30 



<210> 138 

<211> 41 

<212> PRT 

<213> Homo sapiens 



<400> 138 
Met Ser Gly Arg Val Phe Arg Cys 

1 5 
Leu Ser Glu Leu Phe Thr Glu Ala 
20 

Glu Gly Gin Lys Glu Phe Ser Ala 
35 40 



Gin Ala Leu Val Ala Tyr Thr Val 

10 15 
Lys Glu Gin Arg Leu Ala Thr Asp 

25 30 
Glu 



<210> 139 
<211> 100 
<212> PRT 
<213> Homo sapiens 



<400> 139 
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Met 


Gin 


Lvs 


Arcr 


Leu 


Lys 


Val Val 


Thr 


Thr 


Val Leu Thr Asp 


Ala 


Ser 


1 








5 








10 




15 




Lys 


Gly 


Ser 


Leu 


Asp 


Gin 


Gly Ser 


Glu 


Ala 


Thr Ser Ala Asn 


Ser 


Leu 








20 








25 




30 






Cys 


Gly 


Ala 


Cys 


Val 


Cys 


Ala Ser 


Ser 


Glu 


Leu Arg Ser His 


Gly 


Leu 






35 








40 






45 






Ser 


Arg 


Ser 


Asp 


Gly 


Ser 


Ser Asp 


Ser 


Phe 


His Val Pro Trp 


Leu 


His 




50 










55 






60 






Gly 


Ala 


His 


Ala 


Leu 


Val 


Leu Leu 


Pro 


Asn 


Ala Gly Ala Ala 


Glu 


Ser 


65 










70 








75 




80 


Pro 


Leu 


Ala 


Arg 


Pro 


His 


Pro Arg 


Glu 


Thr 


His Val Gly Ala 


Val 


Pro 










85 








90 




95 




Glu 


Asp 


Ser 


Leu 



















100 



<210> 140 
<211> 53 
<212> PRT 

<213> Homo sapiens 



<400> 140 

Met Ser Leu Lys Gin Ala Leu Gin Ser Pro Gly Leu Phe Leu Pro Ala 

15 10 15 

Leu Ala Pro Ala Pro Ser Ser Ser Cys Val Gin Asn Gin Ser Gin Arg 

20 25 30 

Cys Gly Leu Asn Pro Gly Ser Cys Pro Asp Thr Trp Leu Thr Ser Ser 

35 40 45 

Phe Ser Leu Pro Ser 
50 



<210> 141 
<211> 419 
<212> PRT 

<213> Homo sapiens 



<400> 141 



Met 


Glu 


Ala 


Ala 


Asp 


Val 


Ala His 


1 








5 






Gly 


Gly 


Phe 


Arg 


Asn 


He 


Ala He 








20 








His 


Phe 


Pro 


val 


Phe 


Lys 


Arg Lys 






35 








40 


Val 


Glu 


Met 


Pro 


Thr 


Ala 


Ser Gin 




50 










55 


Leu 


Ser 


Glu 


Gin 


Asp 


lie 


Val Ser 


65 










70 




Ala 


Gin 


His 


Leu 


Glu 


Asp 


Gly Pro 










85 






Phe 


Leu 


Pro 


Arg 


Val 


Pro 


Lys Val 








100 








Cys 


Leu 


Glu 


Ala 


Gly 


Thr 


Trp Arg 






115 








120 


lie 


Gin 


Val 


Pro 


Asp 


Asp 


He Tyr 




130 










135 


Glu 


Ser 


Ala 


Leu 


Ser 


Pro 


Asp Arg 


145 










150 




His 


Pro 


Leu 


Asp 


Asp 


Thr 


Arg Pro 



165 



Arg 


Ala 


His Met Ser Gin 


Lys 


Ala 




10 




15 




Gin 


Thr 


Ser Pro Ser Leu 


Arg 


Lys 


25 




30 






Arg 


Leu 


Thr Ala Ser Lys 


Ser 


Leu 






45 






Ser 


Ala 


He Gin Val Asn 


Gly 


Asn 






60 






Ser 


Asp 


Leu Ala Tyr Leu 


Arg 


Leu 






75 




80 


Arg 


Arg 


Val Lys Val Ser 


His 


Ala 




90 




95 




Gin 


Ser 


Asn Gly Pro Val 


Ser 


He 


105 




110 






Ser 


Leu 


Glu Lys Ala Thr 


Ala 


Ala 






125 






His 


Ser 


Pro Ser Trp Glu 


Ala 


Arg 






140 






Ser 


Ala 


Glu Val Ser Asn 


Ser 


lie 






155 




160 


Gly 


Asp 


Gly Arg Arg Val 


Thr 


Pro 




170 




175 





82 



WO 01/74836 



Leu 


Asp 


Ser 


Glu 


Lvs 


Ser 


Thr 


Ser 


Cvs 








180 










185 


Ser 


His 


Thr 


Pro 


Gly 


Thr 


Glu 


Glu 


Leu 






195 










200 




Lys 


Asp 


Asn 


Ser 


Asp 


Asp 


Lys 


Asp 


Leu 




210 










215 






Lys 


Glu 


Thr 


Cys 


Val 


Pro 


Ser 


Ser 


Pro 


225 










230 








Gin 


Gly 


Ser 


His 


Ser 


Gin 


Pro 


Ala 


His 










245 










Pro 


Ser 


Ser 


Ser 


Asn 


Asn 


His 


Gin 


Asn 








260 










265 


Ser 


Ala 


Ser 


Lys 


Ser 


Ala 


Pro 


Gly 


Cys 






275 










280 




Pro 


Thr 


Glu 


Ser 


Asp 


Thr 


Leu 


Glu 


Phe 




290 










295 






His 


Leu 


Pro 


Ser 


Ser 


Leu 


Ser 


Arq 


Ser 


305 










310 








Arg 


Glu 


He 


Ser 


Asp 

XT 


He 


Asn 


Gin 


He 










325 










Cvs 


Asp 


Leu 


Gin 


Gly 


Arq 


Leu 


Gin 


Ser 








340 










345 


Asn 


Gin 


Glu 


Lvs 


He 


Lvs 


Val 


Leu 


Leu 






355 










360 




Lys 


Ala 


Arg 


Ala 


Leu 


Thr 


Glu 


Gly 


Leu 




370 










375 






Glu 


His 


Leu 


Asp 


Thr 


Ser 


Tyr 


Leu 


Thr 


385 










390 








Met 


Pro 


Arg 


Asp 


Ser 


He 


His 


Glv 


Ser 










405 










Val 


His 


Lys 
















<210> 142 














<211> 270 














<212> PRT 














<213> Homo 


sapiens 










<400> 142 












Met 


Asp 


Ser 


Gin 


Glu 


Val 


Glu 


Lvs 


Tvr 


l 








5 










Glu 


Glu 


He 


Pro 


Phe 


Ser 


Glv 


He 


His 








20 










25 


Ala 


Leu 


Glu 


His 


Gly 


Lys 


Asp 


Asp 


Leu 






35 










40 




Leu 


Phe 


Cys 


Phe 


Ser 


Ser 


Phe 


Ser 


Asn 




50 










55 






Lys 


Val 


Phe 


Leu 


Lys 


Ash 


He 


His 


lie 


65 










70 








Phe 


His 


Leu 


Leu 


Thr 


Met 


Ala 


Lys 


Ala 










85 










Val 


Glu 


Pro 


Val 


Ser 


Pro 


Ser 


Gin 


Pro 








100 










105 


Leu 


Glu 


Glu 


Leu 


Trp 


Thr 


Arg 


Leu 


Thr 






115 










120 




Gin 


Arg 


Asp 


Arg 


Glu 


Lys 


Gly 


Gly 


Trp 




130 










135 






His 


Gin 


Gly 


Val 


Gly 


Lys 


Gin 


Glu 


Glu 


145 










150 








Lys 


Ala 


Gly 


Asn 


Trp 


Gly 


Asp 


Gly 


Arg 
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Leu 


Asn 


Ala 


Thr 


Ser 


Val 


Ala 










190 






Lvs 


Pro 


Glu 


Leu 


Leu 


Leu 


Pro 








205 








Gly 


Ser 


Leu 


Ser 


Ser 


Gin 


Ser 






220 










Arg 


Thr 


His 


Ser 


Ser 


Pro 


Ser 




235 










240 


Pro 


Glv 


Arq 


Ala 


Ser 


Asp 


Cys 


250 










255 




Leu 


Val 


Ser 


Leu 


Lvs 

2 


Thr 


Asn 










270 






Gin 


Glu 


Gin 


Thr 


Ala 


Asn 


Asn 








285 








Pro 


Asn 


Cys 


Pro 


Glv 


Ser 


Asn 






300 










Glu 


Thr 


Lvs 

2 a 


Leu 


Gin 


Ser 


Asn 




315 










320 


His 


Leu 


Ala 


Arcr 


Gly 


Glu 


Leu 


330 










335 




Val 


Glu 


Glu 


Ser 


Leu 


His 


Ser 










350 






Asn 


Val 


He 


Gin 


Asp 


Leu 


Glu 








365 








Leu 


Gly 


Ser 


Pro 


Leu 


Thr 


He 






380 










Lys 


Ser 


Thr 


Asp 


Pro 


Thr 


Pro 




395 










400 


Gin 


Glu 


Leu 


Ala 


Glv 


He 


Ser 


410 










415 




Pro 


Asn 


Thr 


Ser 


Val 


Ala 


Cys 


10 










15 




Val 


Ala 


Glv 


Glv 


Lys 


Ser 


Glv 










30 






Asp 


Glu 


Pro 


He 


Glu 


Asn 


Pro 








45 








Ala 


Leu 


Ala 


He 


Leu 


Leu 


Pro 






60 










Leu 


Gin 


Phe 


He 


Tvr 


Arq 


Ser 




75 










80 


Lvs 


Phe 


Glu 


Glv 


Ala 


Glu 


Ser 


90 










95 




Lys 


Arq 


Pro 


Ser 


Tvr 


Val 


Pro 










110 






Lvs 

2 


Glv 


Asn 


Ser 


Aro 


Pro 


Gin 








125 








Met 


Lys 


Gly 


Val 


Gin 


Gin 


Gly 






140 










Gly 


Ser 


Glu 


Asn 


He 


Lys 


Glu 




155 










160 


Arg 


Trp 


Gly 


Arg 


Asn 


Gin 


Gin 


170 










175 
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Val 


Leu He Asn 


Lys He Thr 


Phe Lys 




180 




185 


Ser 


Asn Leu Val 


Phe Pro Glu 


Glu Ser 




195 




200 


Leu 


Ser Trp Thr 


His He Val 


Leu His 




210 


215 




Lys 


Val Lys He 


Gly Cys Leu 


Leu Arg 


225 




230 




His 


Ala Leu Ser 


Val Thr Ser 


Glu Leu 






245 




Ala 


Gly Phe Asp 


Glu Ala Tyr 


Lys Leu 




260 




265 




<210> 143 








<211> 78 








<212> PRT 








<213> Homo 


sapiens 






<400> 143 






Met 


Gly Leu Cys 


lie Thr Ser 


He Ala 


1 




5 




Pro 


Thr Gly Cys 


Leu Ser He 


Lys Tyr 




20 




25 


Ser 


Phe Arg Ala 


Leu Lys Glu 


Leu Cys 




35 




40 


Leu 


Pro Thr Ser 


Leu Thr Val 


He Leu 




50 


55 




Lys 


Thr Leu Trp 


Leu Phe Val 


Leu He 


65 




70 






<210> 144 








<211> 80 








<212> PRT 








<213> Homo 


sapiens 






<400> 144 






Met 


Thr He Val 


Arg Asn Gly 


Val Arg 


1 




5 




He 


Lys Arg He 


He Arg Glu 


Tyr Tyr 




20 




25 


Phe 


Thr Lys He 


Gly Asn Ala 


Tyr Ala 




35 




40 


Arg 


Lys Gin Tyr 


Asp Leu Thr 


Gly Asn 




50 


55 




Gin 


Asn Asn Gly 


Arg Phe Asn 


Phe His 


65 




70 





<210> 145 
<211> 219 
<212> PRT 
<213> Homo sapiens 



<400> 145 



PCT/US01/10472 



Gin Thr Ser Leu Asn Tyr Thr 
190 

Tyr Phe Ser Asn Ala Tyr Ser 
205 

His Cys Glu He His Leu Arg 
220 

Glu Thr Asn Leu Lys Val Arg 
235 240 
Gly Ala Asp He Ser Arg Leu 
250 255 
Thr Ser He Gly Ser 
270 



Thr Trp Ser Ala Ala Gly Ser 

10 15 
Phe Ser Thr Gly Arg Ser Ser 
30 

Cys Leu Phe Pro Gly Leu Cys 
45 

Ala Gly Asn He Leu Leu Gin 
60 

Ser Phe Pro Pro Leu 
75 



Asp Lys Asp He Thr Ala Asp 

10 15 
Glu Gin Phe His He His Lys 
30 

Val Leu Ser Asn Pro Glu Lys 
45 

Glu Glu Gin Ala Cys Asn His 
60 

Arg Gly Phe Val Lys Leu He 
75 80 
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Met 


Pro 


Lys 


Lys 


Asp Asp Ser 


Asn 


Trp 


1 








5 








Glu 


Gin 


Leu 


Gly 


Gly Pro Ser 


Ala 


Lys 








20 








25 


Thr 


He 


Cys 


Thr 


Trp 


Pro Thr 


Phe 


Ala 






35 








40 




Ser 


Leu 


Pro 


Gly 


Arg Ser Val 


Pro 


Pro 




50 








55 






Val 


Tyr 


Gin 


Pro 


Thr Tyr Ser 


Tyr 


Ser 


65 










70 






Glu 


Lys 


Ala 


Asn 


Lys 


Arg Pro 


Thr 


Pro 










85 








Pro 


Leu 


Gly 


Leu 


Val 


Asp Asp 


Ala 


Phe 








100 








105 


Ser 


He 


Val 


Ala 


Abu 


Val Val 


Ser 


Ala 






115 








12 0 




Gly 


Pro 


Pro 


Glu 


Asp Trp Asn 


Glu 


Arg 




130 








135 






Gin 


Lys 


Ala 


Tyr 


Met Gly Ser 


Cys 


Gly 


145 










150 






Ala 


Asn 


Asn 


Asn 


Met 


Ser Thr 


Thr 


Ser 










165 








Tyr 


Ala 


Tyr 


Tyr 


Arg Gin Ala 


Ser 


Arg 








180 








185 


Met 


Glu 


Thr 


Arg 


Ser 


Asp Tyr 


Cys 


Leu 






195 








200 




Asn 


Pro 


His 


Pro 


Arg Gly Trp 


Arg 


Arg 




210 








215 








<210> 146 












<211> 214 












<212> PRT 












<213> Homo 


sapiens 








<400> 146 










Met 


Gly 


Ala 


Lys 


Ala 


Asp Ser 


Leu 


Ala 


l 








5 








Arg 


Gly 


Ser 


Gin 


Arg 


Gin Phe 


Ala 


Lys 








20 








25 


Ser 


Phe 


Gly 


Gly 


Lys 


His Ala 


Val 


Ser 






35 








40 




Gin 


His 


His 


Pro 


Arg 


Gly Arg 


Leu 


Leu 




50 








55 






Leu 


Ala 


Asn 


Pro 


Lys 


Ala Thr 


His 


Val 


65 










70 






His 


Arg 


Cys 


Thr 


Ala 


Asp Leu 


Ser 


Ala 










85 








Phe 


Pro 


Cys 


Thr 


Lys 


Glu Leu 


Ser 


Lys 








100 








105 


His 


Gly 


Gly 


Ser 


Arg 


Pro Ala 


Asp 


Lys 






115 








120 




Leu 


Glu 


Ala 


Glu 


Ser 


Pro Lys 


Gin 


Ser 




130 








135 






Arg 


His 


Asn 


Leu 


Ala 


Ser Arg 


Pro 


Gly 


145 










150 






Gly 


Thr 


Met 


Gin 


Gly 


Arg Pro 


Leu 


Ser 










165 








Thr 


Thr 


Phe 


Val 


Thr 


His Ser 


Val 


Pro 








180 








185 


Glu 


Arg 


Gly 


He 


Gly 


Asp Leu 


Glu 


His 
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Gly 


Tyr 


Ala 


Phe 


Thr 


Asp 


Gly 


10 










15 




Gly 


Lys 


He 


Lys 


Trp 


Gin 


Pro 










30 






Lys 


Glu 


Ser 


Gly 


Arg 


Ser 


Leu 








45 








Pro 


He 


Ser 


Thr 


Ser 


Pro 


Trp 






60 










Ser 


Lys 


Pro 


Thr 


Asp 


Gly 


Leu 




75 










80 


Trp 


Glu 


Ala 


Ala 


Ala 


Lys 


Ser 


90 










95 




Gin 


Pro 


Arg 


Asn 


He 


Gin 


Glu 










110 






Ala 


Arg 


Arg 


Lys 


Val 


Leu 


Pro 








125 








Leu 


Ser 


Tyr 


He 


Pro 


Gin 


Thr 






140 










Arg 


Gin 


Glu 


Tyr 


Asn 


val 


Thr 




155 










160 


Gin 


Tyr 


Gly 


Ser 


Gin 


Leu 


Pro 


170 










175 




Asn 


Asp 


Ser 


Ala 


He 


Met 


Ser 










190 






Pro 


Val 


Ala 


Asp 


Tyr 


Asn 


Tyr 








205 








Gin 


Thr 












Cys 


Phe 


Ser 


Asp 


Glu 


Arg 


Ser 


10 










15 




Glu 


Asp 


Gin 


Ser 


Leu 


Arg 


Arg 










30 






Ala 


Val 


Gly 


Gly 


Ser 


Ala 


Tyr 








45 








Asp 


Ala 


Asp 


Lys 


Phe 


His 


Pro 






60 










Tyr 


Met 


Gly 


Arg 


Ala 


Val 


Arg 




75 










80 


Leu 


Pro 


Leu 


Met 


Phe 


Leu 


Leu 


90 










95 




Gin 


Gin 


Val 


Arg 


Arg 


Gin 


Gly 










110 






Asp 


Leu 


Glu 


Glu 


Gly 


Gly 


Gly 








125 








Pro 


Asn 


Leu 


Cys 


Val 


He 


Leu 






140 










Gin 


Leu 


Ala 


Leu 


Val 


Thr 


Val 




155 










160 


His 


Ser 


Ser 


Glu 


Val 


Lys 


Gly 


170 










175 




Ala 


Gly 


Lys 


Glu 


Lys 


Asp 


Glu 










190 






Ala 


Arg 


Asp 


Leu 


Arg 


Asn 


Ser 



205 
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Pro Thr Pro Leu Phe Tyr 
210 



<210> 147 

<211> 125 

<212> PRT 

<213> Homo sapiens 



<400> 147 



Met 


Arg 


Val 


Pro 


Trp Ser Lys Cys Leu Ser Ala Gly Met 


Pro 


Leu Pro 


1 








5 10 




15 


Ser 


Gin 


Arg 


Leu 


Trp Leu Ser Glu Leu He Tyr Leu Lys 


Phe 


Leu Ala 








20 


25 


30 




Ser 


Gly 


Leu 


Phe 


Asn Gly Tyr Pro Asn Pro Glu Asn Phe 


Ser 


Trp Thr 






35 




40 45 






Glu 


Tyr 


Leu 


Glu 


Ala Thr Gin Thr Asn Ala Val Pro Ala 


Lys 


Val Phe 




50 






55 60 






Lys 


Met 


Asp 


Ser 


Asp Val Gly Glu Asn Arg Lys He Leu 


Arg Asp Glu 


65 








70 75 




80 


Arg 


Pro 


Asn 


Tyr 


Ser Gin Tyr Thr Pro Phe Ser Arg Cys 


Asp Asn Ala 










85 90 




95 


Ser 


Tyr 


Lys 


Glu 


Asn Val Phe Leu Gin Lys Leu Glu Arg 


Asn 


Thr Pro 








100 


105 


110 




Asp 


He 


Ala 


Glu 


Arg Phe Asp Cys Leu Leu Leu Thr Tyr 










115 




120 125 







<210> 148 
<211> 126 
<212> PRT 
<213> Homo sapiens 





<400> 148 








Met 


Cys 


Lys 


Ser 


Trp 


Gin Cys Ser Val Asn Ala Gin Leu 


Gin Pro His 


1 








5 


10 


15 


Leu 


Gly 


Leu 


Leu 


Gly 


Ser Leu Trp Val Met Leu Pro Ser 


Leu Val Gin 








20 




25 


30 


Leu 


Ala 


He 


Ala 


Arg 


Arg Lys Val Trp Pro Phe Gly Pro 


Glu Pro Gly 






35 






40 45 




His 


Leu 


He 


Ser 


Glu 


Phe Gly His Phe Glu Gly Thr Val 


Leu Asp Lys 




50 








55 60 




Pro 


Ser 


Trp 


Val 


Thr 


Cys Arg He Leu Gly Gly Gly Arg 


Leu Gin Gly 


65 










70 75 


80 


Met 


Gly 


Trp 


Leu 


Thr 


He Asp Phe Ser Pro Gin Ala Ser 


Lys Arg Ala 










85 


90 


95 


Leu 


Thr 


Ser 


Ser 


He 


Met Glu Leu Gly Pro Gly Val Ser 


Lys Leu Val 








100 




105 


110 


Gin 


Trp 


Gin 


Leu 


Glu 


Asn Leu Lys Arg His Cys Gly Pro 


Leu 






115 






120 125 





<210> 149 

<211> 53 

<212> PRT 

<213> Homo sapiens 



<400> 149 



86 
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Met Arg Lys Pro Arg Leu Val Lys 

1 5 
His Arg Leu Arg Asp Lys Ala Arg 
20 

Leu Glu Cys Val Asp Ser Pro Leu 
35 40 
Cys Lys Lys Lys Lys 
50 



He Leu Gin Val Ala Gin Gly Pro 

10 15 
Leu Gin He Gin Ala Ser Val Thr 

25 30 
Gin Tyr Thr Thr Ser His His Pro 
45 



<210> 150 

<211> 86 

<212> PRT 

<213> Homo sapiens 



<400> 150 

Met Phe Leu Glu Arg He Gly He Val Leu Cys Glu Arg Glu Phe His 

15 10 15 

Phe Glu Phe Leu Pro Gly Val Val Phe Gly Phe Val Lys Cys Phe Thr 

20 25 30 

Val Met Val Val Leu Arg Lys Cys He Gin Ser Leu Leu Tyr Phe Leu 

35 • 40 45 

Cys Thr Ala Asn Phe Met Gly Leu Pro Ser Arg Ala Gly Thr Glu Leu 

50 55 60 

Ala Ala Tyr Thr Asn Arg Asn Ser Ser Thr Pro Pro Asp Thr Ala Glu 
65 70 75 80 

Ala Ser Pro Cys Tyr Arg 
85 



<210> 151 
<211> 149 
<212> PRT 
<213> Homo sapiens 



<400> 151 



Met 


Glu 


Ser 


Pro 


Cys Asn Thr Gly 


1 








5 


Thr 


Arg 


Ser 


Gly 


Phe Thr Arg Val 








20 




Pro 


Gly 


Ala 


Gly 


Val Met Gly Val 






35 




40 


Val 


Gly 


Ser 


Asp 


Glu Glu Ala Leu 




50 






55 


Ala 


Phe 


Met 


Val 


Lys Phe Cys Ala 


65 








70 


Asn 


Pro 


Gin 


Thr 


Asp Ala Val Leu 










85 


Lys 


Ser 


Pro 


Lys 


Cys Leu Asn His 








100 




Thr 


Val 


Glu 


Thr 


Asn Ala Leu Ala 






115 




120 


Leu 


Pro 


Leu 


Asn 


Gly Ala Val Arg 




130 






135 


Gly 


Glu 


Lys 


Asn 


Val 



145 



Cys Ser Tyr Gly Asn Cys Tyr Lys 

10 15 
Gly Arg Arg Asn Leu Pro Arg Lys 

25 30 
Asp Val Leu Leu Gly Pro Arg Lys 
45 

Lys Lys Ser Gin Tyr Asp Asp Asp 
60 

Ser Ser Thr Glu His Phe Leu Asn 
75 80 
Gly Gin Arg Tyr Gly Arg Glu Ser 

90 95 
Lys His Arg Asp Leu Thr Pro Val 
105 110 
Ser Asp Pro Tyr Arg Ser Ala Phe 
125 

Gly Glu Leu Gly Cys Arg Thr Gly 
140 



<210> 152 
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<211> 48 
<212> PRT 

<213> Homo sapiens 



<400> 152 
Met Met Gly Met 
1 

Glu Glu Pro Leu 
20 

Gly Val Val His 
35 



Tyr Leu Lys Pro 
5 

Lys Asp Glu Lys 

Cys Asp Ser Ser 
40 



Ser Glu Ser His 
10 

Asp Lys Gly Val 
25 

Ala Val Ala Leu 



Pro Asp Ser Gly 
15 

Thr Arg Gly Gly 
30 

Cys Val His Phe 
45 



<210> 153 
<211> 30 
<212> PRT 

<213> Homo sapiens 



<400> 153 

Met His Glu Ser Asn Ala lie Arg lie Thr Val Glu Leu Phe Tyr Leu 

15 10 15 

Tyr Ala Ser Cys Arg Cys Leu Glu Val Leu His Leu Leu Cys 
20 25 30 



<210> 154 
<:211> 82 
<212> PRT 

<213> Homo sapiens 



<400> 154 



Met 


Cys 


Leu 


Ser 


Met Gin Pro His 


1 








5 


His 


Leu 


Ser 


Lys 


Thr Arg Ala Ala 








20 




Trp 


Gly 


Trp 


Ser 


Val Lys Thr Gin 






35 




40 


Ser 


Lys 


Pro 


Ser 


Pro Arg Thr Arg 




50 






55 


Ala 


Arg 


Leu 


Ala 


Ala Lys Val Gin 


65 








70 


Asn 


Ser 









Gin Leu Pro Tyr Phe Ser Cys Leu 

10 15 
His Pro Ala Pro Pro Asp Phe Arg 

25 30 
Ala Arg Arg Lys Thr Pro Ser Pro 
45 

Pro Met Cys Arg Pro Lys Ser Lys 
60 

Ala Val Arg Lys Arg Arg Ser Gly 
75 80 



<210> 155 
<211> 71 
<212> PRT 

<213> Homo sapiens 



<400> 155 

Met Pro Gly Pro Ala Ala Ala Ser His Arg Ala Ser Thr Tyr Val Ser 

15 10 15 

Thr Trp Ser Cys Pro Pro His His Ser Trp His Ala Trp Gin Cys Thr 
20 25 30 
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Val Ala Arg Pro His Leu Gin Thr Ser His Cys Cys Thr Ser Gly Leu 

35 40 45 

Pro Leu Ala Asp Met Glu Ser Arg Leu Val Ala Ser Pro Ser Glu Trp 

50 55 60 

Asn Lys Leu Thr Trp Ala Gin 
65 70 



<210> 156 
<211> 42 
<212> PRT 

<213> Homo sapiens 



<400> 156 

Met lie Ser Ser Ala Phe Leu Leu Leu Thr Leu lie Arg Ser Tyr Leu 

15 10 15 

Leu Leu Arg Tyr Gin Asn Thr Thr Ser Met Thr Glu Leu Asp Pro Arg 

20 25 30 

Arg Leu His Cys Thr He Trp Met Glu Val 
35 40 



<210> 157 

<211> 58 

<212> PRT 

<213> Homo sapiens 



<400> 157 

Met Ser Arg Ala Pro Gly He Leu Ala Ser Trp Arg Arg Ala Pro Ser 

1 5 10 ~ 15 

Leu Ser Val Gin Lys Gly Ser Gin Thr Ala Arg His Thr Cys His Pro 

20 25 30 

Glu Val Pro Leu Gly Asn Cys Phe Leu Pro Val Tyr Lys Ala Ser Pro 

35 40 45 

Leu Thr Val Thr Arg Leu Trp Ala Glu Arg 
50 55 



<210> 158 

<211> 85 

<212> PRT 

<213> Homo sapiens 



<400> 158 

Met Gin Arg Ala Leu Arg Arg Asp Thr Gly Gly Ser Gin Ser Asp Tyr 

15 10 15 

Pro Leu Ala Leu Ser Ser Phe Pro Thr Ala Gly Ser Ser Cys Pro Ala 

20 25 30 

Leu Gly Leu Thr Thr Pro Leu Ala Trp Arg Leu Cys Leu He Phe He 

35 40 45 

Thr Gly Arg Glu Thr Arg Ser He Pro Lys Glu Arg Glu Pro Arg Met 

50 55 60 

Pro Glu Arg Lys Ala Ser Pro Arg Arg Trp Asp Asn Phe Pro Tyr Lys 
65 70 75 80 

Val Trp Gin Ser Ser 
85 
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<210> 159 
<211> 82 
<212> PRT 

<213> Homo sapiens 



<400> 159 



Met 


Leu 


Leu Ala 


Asp Lys Glu Ala 


1 






5 


Pro 


Asn 


Asp Ala 


Asn Tyr Pro Arg 






20 




Ala 


lie 


Gly Phe 


Gly Val Arg Lys 






35 


40 


Gin 


Gin 


Cys Ala 


Gly Asp Phe Leu 




50 




55 


Gin 


He 


Ser Val 


Ser Val Trp Ala 


65 






70 


Asn 


Ser 







Ser Glu Ala Gly Leu Thr Asn Val 

10 15 
Tyr Ser Pro Ala Glu Cys Leu Met 

25 30 
Ser Arg Leu Glu His Leu His Leu 
45 

Ser Ala Ser Leu Ser Pro Thr Asn 
60 

His He His Lys Leu Ala Arg Gly 
75 80 



<210> 160 
<211> 27 
<212> PRT 

<213> Homo sapiens 



<400> 160 

Met Gly Ala Ala Ala Lys Gly Ser Leu Trp Ala Ser Pro Phe Ser Arg 

1 5 10 15 

Ser Val Leu Pro Leu Cys Leu Gly Ser Ser Leu 
20 25 



<210> 161 

<211> 90 

<212> PRT 

<213> Homo sapiens 



<400> 161 



Met 


Trp 


Asn Met 


Leu 


Ser Asp Ser 


1 






5 




Ser 


Ala 


Arg Val 


Leu 


Ser Phe Gly 






20 






Thr 


Ser 


Ser Phe 


Thr 


Leu Asn Ser 






35 




40 


Phe 


Val 


Thr Thr 


Thr 


He Phe Gin 




50 






55 


His 


Pro 


Leu Lys 


Phe 


Tyr Thr Arg 


65 








70 


Phe 


Leu 


Leu Ala 


Ala 


Ser Lys Glu 



85 



Asn Tyr Gin Gin Phe Pro Ser Asp 

10 15 
Ser Arg Val Leu Tyr Ser Lys Ser 

25 30 
Gin Asp Leu Tyr Ser Trp Thr Gin 
45 

He Cys Ser His Lys His Pro Lys 
60 

He Thr Glu Lys His Pro Ala Asp 
75 80 

Gin Leu 
90 



<210> 162 

<211> 66 

<212> PRT 

<213> Homo sapiens 



90 
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<400> 162 



Met 


His 


Cys 


Ala 


Val Cys Met Val 


1 








5 


Phe 


Ser 


Pro 


Lys 


lie Leu Lys Arg 








20 




Thr 


Gly 


lie 


Lys 


Glu Asp Gin lie 






35 




40 


Phe 


Pro 


Leu 


Pro 


Leu Ala Thr Ser 




50 






55 


Trp 


Thr 









Ser Phe Gin Pro Pro Pro Lys Trp 

10 15 
Asn Phe Met Phe Leu Cys lie Phe 

25 30 
His Asn Phe Trp Gin His Cys Ser 
45 

Leu Trp Thr Trp Glu Thr Ser Leu 
60 



<210> 163 

<211> 75 

<212> PRT 

<213> Homo sapiens 



<400> 163 



Met 


Thr 


Ser Arg 


Glu 


Asn Thr Asn 


1 








5 




Arg 


Ser 


Ser 


Leu 


His 


Arg Cys Phe 








20 






Ser 


Arg His 


Arg 


Leu Arg Thr Asn 






35 






40 


Trp 


Pro 


Met 


Gly 


Trp 


Ser Val Cys 




50 








55 


Met 


Ser 


Ser 


Ser 


Ala 


Gin Arg Leu 


65 










70 



Gly lie Asn Ser Ala Val Ser Phe 

10 15 
Ser Lys Leu Met Phe Ser Glu Asn 

25 30 
Pro Pro Ser Pro Phe Arg Leu Gly 

45 

Val Leu Val Gly Pro Ala Val Tyr 
60 

His Ser His 
75 



<210> 164 

<211> 46 

<212> PRT 

<213> Homo sapiens 



<400> 164 
Met lie Leu Thr Thr Leu Met Tyr 

1 5 
lie Gly Phe Ala Phe Gly Thr Pro 
20 

Phe Leu His Ser Lys Gin Glu Gly 
35 40 



Gly Tyr Tyr Tyr Leu Asp Asp His 

10 15 
Arg Ser Leu Ser Leu Ser Ser Asp 

25 30 
Tyr Phe Ser Pro Thr Leu 
45 



<210> 165 
<211> 49 
<212> PRT 

<213> Homo sapiens 



<400> 165 
Met Ser Ser Val Gly Ser Thr Glu 

1 5 
Arg Gly lie Gly Val Ser Glu Arg 
20 

Thr Glu Leu Ser Ala Asp Cys Asn 
35 40 



Ala Gly Arg Ser Glu Gin Ala Glu 

10 15 
Val Gly Val Gin Ala Gin Asp Arg 

25 30 
Thr Val Thr Pro Lys Val Lys Ala 
45 
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Ser 



<210> 166 

<211> 56 

<212> PRT 

<213> Homo sapiens 



<400> 166 
Met Val His Arg Pro His He Ser 

1 5 
Arg Ser Arg Leu His Thr Phe Arg 
20 

Asn Gly Ser Thr Arg Leu Asp Leu 
35 40 
Arg Leu Gin Arg Val Ala Leu Asp 
50 55 



Leu Gin Leu Asn Phe Gly Pro Lys 

10 15 
Asn Lys Ser Tyr Ser Ser Ala Gin 

25 30 
Pro Thr Arg Pro Ala Trp Pro Leu 
45 



<210> 167 

<211> 8 

<212> PRT 

<213> Homo sapiens 



<400> 167 
Met Met Arg Lys Ser Leu Pro His 
1 5 



<210> 168 

<211> 2067 

<212> DNA 

<213> Homo sapiens 



<400> 168 

ggaggcagag ctctgggaat ctcaccatgg cctggacccc tctcctgctc cccctcctca 60 

ctttctgcac agtctctgag gcctcctatg agttgacaca gccaccctcg gtgtcagtgt 120 

ccccaggaca aacggccacg atcacctgct ctggagatgc attgccaaaa aaaacatcct 180 

tattggtacc agcagaagtc aggccaggcc cctgtactgg tcatctatga ggacaacaaa 240 

cgaccctccg ggatccctgc agagattctc tgcctccagc tcagggacaa tggccacctt 300 

gactatcagt ggggcccagg tggaggatga agcggactac tactgctact caacagacag 360 

cagtggtaat catagggggg tgttcggtgg agggaccagg ctgaccgtcc tcagtcagcc 420 

caaggctgcc ccctcggtca ctctgttccc accctcctct gaggagcttc aagccaacaa 480 

ggccacactg gtgtgtctca taagtgactt ctacccggga gccgtgacag tggcctggaa 540 

ggcagatagc agccccgtca aggcgggagt ggagaccacc acacccggca aacaaagcaa 600 

caacaagtac gcggccagca gctacctgag cgctgacgcc tgagcagtgg aagtcccaca 660 

aaagctacag ctgccaggtc acgcatgaag gggagcaccg tggaggagac aggtgcccct 720 

acagaatatt tattgagggt ttattgagtg cagggagaag ggtcttgatg cccttggggt 780 

gggaggagaa gacccctctc ccctggggat tcctgcaggt tctagttctt cccgtggggg 840 

gggagtgagg ggttgagaaa cctatggaac attcgtgtag ggggcccctg tcttctccca 900 

cggtgctccc cttcaatgcc gtgaccctgg caagctgtag ctttctgtgg aactttccac 960 

ctggtcaggg cgtcaggctc agatagcctg ctgggccgcg tacttcgttg ttgctttgtt 102 0 

ttggagggag tggatggtct ccatctcccg ccattgacgg gggctgctat gctgccttcc 108 0 

caggccactg tacacggctt cccgggtaag aagtcactta tggagacaca ccaagtggtg 1140 

gcccttgttg ggcttgaaag ctccctcagc agggagggct gggaacaaga gtgacccgag 1200 

gggagcagcc tttgggtgta ccatagggac gggtcagctt tggtcccctc cgccgaacac 1260 

agcacacagt gacacagccc cacacacaaa cccctcctga cacacccgac ccgttccccg 1320 
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ctgcagcagc ttgggcgccc aggtccagtg gccctgggtc cagagggtca gggtccagtc 138 0 

tgtgcctgct tcagagatag atgtgggggt agagggcagg gtcttgttgt tgctctgttt 1440 

ggagggtgtg gtggtctcca cfccccgcgtt gacggggctg ccatctgcct tccaggccac 1500 

ttttacagct cccgggtaga agtcactgat caggcacacc agtgtggcct tgttggcttg 1560 

aagctcctca gaggagggcg ggaacagagt gaccgagggg gcagccttgg gctgaccgag 162 0 

gacggtcacc ttggtgccac tgccgaacac attacactgt gatataacca gggcagcagg 168 0 

aagcacaaca gagacgtggg taagaatgaa gatgtctggt tccttctgaa gactatttcc 1740 

ttgttctgga acattctggc agctgtagct tctgtgggac ttccactgct cgggcgtcag 1800 

gctcaggtag ctgctggccg catagttgtt gttgctctgt gtggagggtg tggttgtctc 1860 

tatgcccttg gtgactaggg tgccattttc cttcagcatg gccttgttgg cttggagctc 192 0 

ctcagaggag ggcgggaaca gagtgaccaa gggggtggcc ttgagcggac ctgtgagttt 198 0 

ccattcccag acctttccac aagttcctgg tgatggcacc accatggact tctcacccga 204 0 

taaactgggc acctggattt tcctcat 2067 



<210> 169 
<211> 1692 
<212> DNA 

<213> Homo sapiens 
<400> 169 

atggagaaag gcagcggctt cataaagtac tcaacttaca aacaagggac aattagagtt 60 

gcagaagaag cagaaacagc tcacagctct gtcttaattg gtccagaaaa gggcgtggtg 12 0 

cacctggcga cagcggccgt cctaaacgcg gtgtgggact tgtgggccaa gcaggaggga 180 

aaggttttgg cagtaggtag ggaactgcag gaggaggaga aagaggagac aggatggcgg 240 

aaggcgcagg cagcagtaga ggggggtgtg gggacctggt ggctgacagc cagcattaga 300 

gctgccaacg cgtttactgt caggaaaaaa tggggacttt acacatatgt cttacaaatc 360 

ctttcttttt tacttcaagc ctgtctggaa gttacttgtg gacatgattt gataatgggc 420 

tgcattaaaa gtaaagaaaa caaaagtcca gccattaaat acagacctga aaatactcca 480 

gagcctgtca gtacaagtgt gagccattat ggagcagaac ccactacagt gtcaccatgt 540 

ccgtcatctt cagcaaaggg aacagcagtt aatttcagca gtctttccat gacaccattt 600 

ggaggatcct caggggtaac gccttttgga ggtgcatctt cctcattttc agtggtgcca 660 

agttcatatc ctgctggttt aacaggtggt gttactatat ttgtggcctt atatgattat 720 

gaagctagaa ctacagaaga cctttcattt aagaagggtg aaagatttca aataattaac 780 

aatacaccaa tggttctcaa cctgggacag aatcacccag gggatatttg gcagtatctg 840 

gagacatttt tggttgtcac agttggggtg ctgccgctgt cttctagtgc ctcaactcct 900 

gtatttgatc gagtcaccaa tggagtaact ccaactatca aggatctcac tggctgctgt 960 

gttgagaaca gactactgac ttctaatagc agcgacttct ttaccttgat aaaccacagc 102 0 

aactcctcca aaacaccctt ccagaacaca cgtttggttg tcagcagagg gaattcatct 1080 

gaaaaacaat ttgccatacg gtttcaagat gggaagacag atcatgccat ccaactttct 1140 

tcagggaaga aaactgccct gggacgtgag gccctggagc atccagaaag cctagactcc 1200 

cgcaaagtgg gccagagatc acggtggagc tcccaggctg caagtcccat cagtggcccg 1260 

attcaggctg agacagcact tctgtgcccc ggtgaccagt ggacacagga attccacaca 132 0 

tctcctttgc taactgttcc tcatctccct gacatctaca cactggactg ttgccgaaaa 1380 

gacttctcca tttacattca ttcctttggt gatatcaccc agtcttatat ctttaaatac 1440 

cacctgcaaa ttgatgatta ccagctctgt gcccaggcgc tgaaggatgg ctggaccagg 1500 

ccacctcctt tccacactgc tcatctgcac ttctcccttc tgactctcgc ctgtgcagag 1560 

acggtcactt ctgcagaaac tccagatgcc cttgccaagt ccaggtttaa agtaaaggtg 1620 

ggtgctgatc tccaggatga catgcgaaga tgccaaatca tccgagacat gattggaccg 1680 

gaaaagactt tg 1692 



<210> 170 
<211> 949 
<212> DNA 
<213> Homo sapiens 



<400> 170 

ggcacgaggg acgccaccgc cgaggaaaac cgtgtactat tagccatggt caaccccacc 60 
gtgttcttcg acattgccgt cgacggcgag cccttgggcc gcgtctcctt tgaggtaagg 120 
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ggcctggata ccaagaagtg actgctcatc taatccataa agctatgtta acagattgga 180 

ggtagtagca ttttcattac aagtgactaa aagaacagct gtttacccct gatcgtgcag 240 

cagtgcttgc tgttccttag aattttgcct tctgtttgca gacaaggtcc caaagacagc 300 

agaaaatttt cgtgctctga gcactggaga gaaaggattt gggttataag gggttccctg 360 

ctttcacaga attattccag ggtttatgtg tcagggtggt gactgtgaag tcaccataat 420 

ggcactggtg gcaagtccat ctacacggag aaatttgaag atgagaaact tcatcctaaa 480 

agcatacggg gtcctgggat ctttgtccat ggcaaatgct tggacccaac acaaattggt 540 

tcccagtttt tcatctgcac tgccaagact gagtggttgg gatggcaagc cttgtggtgt 600 

ttggcaaagt gaaagaaggc atgaatattg tggaggccat ggagcgcttt gggtccagga 660 

atggcaagac cagcaagaag atcatcagca ttgctgactg tggacaactc gaataagttt 720 

gacttgtgtt tttatcttaa ccaccagatc atttcttttt gtagctcagg agagcacccc 780 

tccaccccat ttgctcgcag gatcctaaga attttgtggc tctcgctgca gttcctttgg 840 

ggttccatgt tttccttgtt ccctcccatg cctagctgga ttgcagcagt taagcttatg 900 

attatgaaat aaaaactaaa taactcacct acaacggatg taattaacg 949 



<210> 171 

<211> 2331 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (2331) 
<223> n = a,t,c or g 



<400> 171 

gccctttttt tttttttttg gtcatttata ttttatttaa gagctgtgcc cagttttatc 60 
atctcacaag aatgaagcaa gggacaaagg taagtgccac gctccctggc cactgggttc . 120 

ctggcaagct cccagccact aggtgccaat ctcccttcaa tgtactcctt cttccccaga 180 

gtgcagaagc gtatgaagac agttatgaca tggacacatg catgagctat tatgcataat 240 

tacaaaagct gaafcctgtca tcaaccaaca tctttgtctc atcagtagga gcgaaatggc 300 

tggccgggac ggtggcacag tcagcctcgt tcaaagtttt gtcgattacg ggtctatatc 360 

ccagggtgac catgaaaaga agagtcttcc cagtggccaa cgtccacata ggacagggtg 420 

tgcttcctcc agtggctcct caaaagggct tcttctgttg cccctggatg ggcttggagt 480 

aatcgtactc atcaatccgc accttgtagt cttccctggc atgcgcgccc cgtgactcct 540 

tccgcgcctc tggctccgta gatggtcttg aagcgcacac agccatcagg ttctgcagct 600 

ccagggtctn cacccaggtc ccgtgttcca gaccattccc cggtcgaacg tcttcaggtg 660 

ctttaggtct ccatagagct tgctgatttt cccacaaacc ttctttggaa acaacgcttc 720 

cccacacggg aacacggcag catgattttg cattgacttc tgcatgctga gtcgcagttc 780 

cgatgttctt atgcttccat cagcaaatct caatttgtca agattcatga cagattcttc 840 

cccagcgttt ggtttaattg gagggacttt atctccaggc ctgcatgact cttcgatgct 900 

cagggcacat gcccgaccaa agacaaccag gtccaagagc gagtttgccc cgaggcggtt 960 

ggcaccatgt accgaggcac aggcggcctc cccacaggcg tacaggccgg gcacaatctg 1020 

atcctggcca ttcacgtgcc tcaggacctg ccccttgtag ttggtgggaa tgccgcccat 1080 

gttataatgc acggtgggga ggacagggat cggctccttc gtgacgtcca cgccagcgaa 1140 

gatcatggct gtctctgaaa tgccaggcag gcgcgtggcc agctgctctg gaggtaggtg 1200 

gtgcagctgc aggtagacgt gatctttctc agggccacag cctcttcctt ctcggatctc 1260 

cagagtcatc gaccgagaca ccacatctct agacgccagg tccttcgcga caggggcgta 1320 

tcgctccata aacctttcgc cttgactgtt aatgagaatg cctccctctc cacgacatcc 1380 

ttccgtaatg agacaaccag caccatatat gcctgtgggg tggaactgaa caaactctag 1440 

gtcctggcaa ggaaggcctg ccctggtgat catggccgtg ccgtcgccag tgctggtgtg 1500 

ggcagacgtg cagctgaagt aggtgcgccc gtagcctcct gtggcaacaa cagtgttctt 1560 

tgctcttatg cgatggatgg acccgtcctc tatgcacagt gcgatgacac cacggcactc 1620 

cccgttctcc atcaggagat ccaaggcaaa atactccaca aaatagctgg tatcatatcg 1680 

cagagaccgt ccatataagg tgtgcaatag cgagtggcca gtccgatcag ccacacagca 1740 

gcaccgatgg gcctgcccgc cctttccaaa cttgaggctc tgtccaccaa atgcacgctg 1800 

ataaatcttc ccatcttcag ttctgctaaa cggcatgcca taattttcta gctcgaccac 1860 

ggcggcgggg gcctgctccg tcatgtagtg gatggcatcc tggtccccca gccagtcgga 1920 

gcccttcacg gtgtcgtaga aatgccacct ccagttgtcc tcctccatgt tccccagagc 1980 

agcattgatt cctccctgcg ctgcaacagt gtgtgacctg gtaggaaaca gcttggtaac 2040 

acatgctgta ttaaaccctg cctcagaaag gccaaatgca gctcgcaagc ctgcccctcc 2100 
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agcgcctacc accactgcat caaattcatg atccactact ggatactgag cagaaatgga 2160 

atctgaaact ttagcagatg ccctcttgtt cccatcaaca gtgaagtgaa aacctcgggt 2220 

tcctgtttgc aacactgttg gccacgcctt ggccagcgcc aggcgccgag cgctcagcag 2280 

ccgcgacagg ccccggaccc ccgacatgtc tgctgttgcc gccgcgcagt c 2331 



<210> 172 
<211> 416 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (416) 
<223> n = a,t,c or g 



<400> 172 

gttagagatt acaatttgac tgaagaacag aaggcgatca aggccaagta tccgccagtc 60 

aataggaagt acgagtattt ggatcataca gcagatgtcc atggattgta ttacacagag 12 0 

catgaatata cttctttagg ttacacgcat ggggagatac tctggaggaa gcatttgagc 18 0 

aatgtgcaat ggccatgttt ggttacatga cagatactgg gacagtggag cccctccaaa 240 

cagtagaagt agaaacccaa ggtggggaga agaattttca ttgtccaagc accctcaggg 300 

aacagaagtc aaagcaataa catattcagc aatgcaggtc tataatgaag agaacccgga 36 0 

agtttttgtg atcattgaca tttaagacac ccaaaaatan aagactccta cgaaaa 416 



<210> 173 

<211> 1737 

<212> DNA 

<213> Homo sapiens 



<400> 173 

tcaatataag cctgtgtagt cccgtaggcc tctgggaata attagctcaa cagtggactg 60 

tgaaacacac actgcttcct tcctggtgag ttctgctctg agctccaatt gcagcccagt 120 

tgtcacacac tgaaaagcaa aacgcagttc agctggtttt aaaagaccca ttgacagccc 18 0 

agggcttgta aaacgatgtg tacgggtagc ttcagctggc cccttgcctt ccccagagcc 240 

cgcttcggac atggtgtggg ttgcatcctt ctctgcaaac tgcctgtgtg ctctgtggca 300 

cggcttccgg caagagattt cctctgcagc attaatgtct cgtggaaggc caagtctctt 360 

gctttgtgat cccattccac tcttccacca tgattgtgag gcctccccag ccatgtggaa 42 0 

ctgtaagtcc attaaatctc tttcttttgc aaatttccca gtctcaggta tgtctttatc 480 

agcatcgtga aaacagactg atacagtaaa ttggtaccag tagagtgggg tgctgctgaa 540 

aagataccct gcaaagccac aggggtggag ctgcccaaga ccatgggaac ctacctcttg 600 

catcagcatg acctggatgt gagacatgga gtcaaaggag atcattatgg aactcaatgg 660 

agacactgca ataggtcact tcttcctgtc caaggaaaaa gcggcgcttt ggaggggata 720 

ggccctcctc gagtttttct cccccgagta aagagctctt ggcggtcaaa gcgcctcgcg 780 

aggggcggag gggcccgggg aacgagtccc ggagcgaacc cagccagcct ctggattccc 840 

acggcccagg gctccgccgg acattcctac cgccctcccc tcgacacccc actaaagaca 900 

gaagaaccgc ggctcgatcg ggtccgcggc ggaagcgcgg ccaaacgaat gaaattcgag 960 

ggtgtaaaga agaggaaggg gaaaaatatc ttgtaccagc ccaggggtga agaagccccc 102 0 

ggcctgagaa agaaggagga gtgggggagg cgaacagtct cgttgctgcc tctgtgtacg 1080 

ctgagggggg agggctcttc ttttcgacta ccctcgctga agattttctc ttccttccct 1140 

cgcccaactc tccctcgggg cctctacgga gacccccccc ccccatgggt ggcccaaaat 1200 

ggagaacgcg gagcagcgca agatggaggc tccgcttctc tgtgtagcag tagatttagg 1260 

ttccttttct ccggcaggct cctaagccta ggagagcccg aactggagag agggaacaac 1320 

attcatgtga cgggcccctc tgcttctccc ccgccccatc atctccagcg cgtggtgggg 1380 

gaactgctgg ggaaggcgat tggcatcgat ctctccatcc cttccgaggc ccgacttcgg 1440 

aattgtctta gaagaaagaa ggcaagccac cattttaccc acgtaaatat atgaatatat 1500 

ttctgacatt gaggtgttcc agaagatgat aaagaaatga tagcagctcc agaaatacca 1560 

actgatttta atctactaca ggagtcagaa acacattttt cttctgacac agattttgaa 162 0 

cgatatcgca ggaaaaaacc gaactcaggg caaaggcaat acttgtagaa aaggcgaaga 1680 
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gggcccagca gaaaaggtct gaggtggaaa aggaagagga acacctactt ctgcgcg 1737 



<210> 174 

<211> 6464 

<212> DNA 

<213> Homo sapiens 



<400> 174 

ctagttgtgg tgacacgtgt aagatgagca agaacactag attgagctaa tgctgactct 60 

tgtctttgta gaatgaccgc cagccttttt tcttttcttt ggaacactat tctgcaggtg 120 

catcagctgc tttttctttt ttctctttgc agttaaaaat tcaagatgtc catttacgct 180 

ttgaagatgg tgtcaccaat ctgttgaatg aatgacattc acgaataaag acagtcatcc 240 

tattttcttc tttgtctttt ttacctcagg tacagaaact aatgcggaaa aagcaattag 300 

acgtagcatt gtaaaagagt gactaccaga aggaaagcca ttcggagcag agcaggtgtt 360 

cttgtgacac acaggaggcc atggccagga gcatggagag tcgcagccat cactacgtcc 420 

tggagcctgt gtttgcatct gctcttttga agagaaactg ctccaagaag cccctgcata 480 

ggtcttaact gagtaaaaga aacaaaccta ttctaaccta gtatctctta acctctaata 540 

caaagctgca ataccggcaa atcatggaat tcctcaagct caggagagga gtggggttat 600 

gtggattgtc ctggattggt agccttttcg ttgcttttgt tgtaagaagt gattacatcg 660 

ttatggacta tttcagctgc cgagaatggt ggtattttgc tttgaatgct aacttgtatg 720 

agatcagaga gcagaggaaa cgttgcacct gggactttat gttgcaccgc gctcgtggtg 780 

tccagtgtct ctggatctac aacgtttgtg tttttgttta tttgcttttg ttttcaccca 840 

aacaggagga aatgtgtcgg attgaagagg aacagagcta acagcagcat atcacagctt 900 

gcttttatct ttgagtactc atctctgctc ctttgtgatt gacacagagt ctgcgggagc 960 

ctcagtttga ttctccagga gcctgtccgg gagccccaga acccggtgga ggcagtggga 102 0 

tgctgcagta tctccagtcc tggtttcctg gatggggtgg ctggtacggg cagcagaccc 108 0 

cagaagaaca tcatcataac tgtgatcagt ctgcttgctg gtgctttttt attcttccac 114 0 

gttttttctt acagtctttt ctcccttacc atttaggcac tgaggagttt tttgacccca 1200 

ctgcagatgc ctcgtgtatg aacacgtata caaagcgaga tcatgtcttt gccaaactga 1260 

atttgcagtt gcagcgaggt acagtgacca caggtctgtt aataaggcta tagttfcttgc 132 0 

cttgttccgt gttcttattc ctgttctttc ctttagctgc agatccagat ggccccgttt 13 8 0 

ttgagatgct gtatgagaga aatccggcgc acagccactt tgagaggcgg ctcaatgtca 144 0 

gcacaaggcc cttgaacatg atgtagcact tttgttcacc cagccgtgtg aaaattctga 1500 

ctctgctgtg tatttccaac cttgcaggtt ttggttatca gtctgaactt gagctgagag 1560 

tggctgaagc tgcccgaaga caatataaca gcttgagcac catggataaa gctgttgggt 162 0 

ttcggcagtt atggttaatg actgttttat ttgtacagga ggagagtaaa cgatggaccg 1680 

tgcggctgga tatttctgcc cctcaggtga tatttcctga tgatattaca ttaagggaaa 174 0 

gaggagggaa atagcagaca tatatttgaa tcttcttttt tttatcttca gataactcca 1800 

ggaggaaaag tagggatggg tcagcatctg aagagaccca gtttagtgat gatgaatata 1860 

agacccccct ggccacacct cctaacaccc cacctcccga gtcaagcagc agcaacggag 192 0 

agaaaacacc tcccttttct ggagttgagt tcagtgaaga acagcttcaa gcacatttaa 198 0 

tgagcacaaa gatgtatgag aggtactcgc tgtcatttat ggacctccag atcatggttg 2040 

gacgagtgaa agacaattgg aagcatgtcc aggatattga cgtgggacca acacatgtgg 2100 

tagagaagtt caacgttcac ctacagttag agcgtcgatt gatttatact tcagatccca 2160 

aatatccagg agccgtgctc tcaggcaact taccagactt aaaaatccac attaatgaag 222 0 

ataaaatatc tgcactaaag aattgctttg ctctcctcac caccccagaa atgaaaactt 2280 

ctgacactca gattaaagaa aagatttttc cccaggagga gcagcgggga agtttgcaag 234 0 

actccgtaat gaatttaacc cagagcattg tgttgttgga gcagcatacc cgcgaggttc 2400 

tggtggagtc gcagctcctc ctggcggaat ttaaagtgaa ctgtatgcag cttggtgttg 2460 

agagcaatgg ccggtacatt tctgtgctca aggtgtttgg taccaatgct cactttgtga 2520 

agaggcctta tgatgctgaa gtctccctaa ctgttcatgg tttgctcctg gtggatacca 2580 

tgcagacata tggtgctgat tttgaccttt tgatggcttc acataagaac ttgagctttg 2640 

atattccaac gggaagcctt cgggatagca gggcccagtc tcctgtctct ggaccgaatg 2700 

tggcccactt aactgatgga gctacactga acgaccgatc agctactagt gtttcacttg 2760 

acaaaattct taccaaagag caagagtccc ttattaagtt ggaatatcag tttgtgagtt 282 0 

cagagtgccc atcgatgaat ttagacagta ctcttcaggt gatttcccta caggtgaata 2880 

atttagatat tatcctcaat ccagagacga ttgtggagct aattggtttt cttcaaaaat 2940 

cctttcccaa ggaaaaagat gatttaagtc ctcaaccttt aatgactgat tttgaaagaa 3000 

gcttcagaga acaaggaact taccagtcta catatgaaca aaacactgag gttgcagtgg 3060 

aaatccatag gctgaactta ctgcttcttc ggacagtggg catggcaaat agagagaaat 312 0 

atggcagaaa aattgcaact gcaagtatag gtggcaccaa agttaatgtc tcaatgggta 3180 
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gcacgtttga catgaatggt tctcttggct gtttacagct tatggatttg acacaagata 3240 

acgttaaaaa ccagtatgtt gtcagcattg ggaattctgt aggctatgaa aatatcatca 3300 

gtgatattgg ctactttgaa tctgtgtttg tcagaatgga agatgcagcc ctcactgaag 3360 

ctttgagttt cacgtttgtt gagagatcta aacaggagtg ttttctcaac ctgaagatgg 3420 

cttctttaca ttataaccac tctgctaagt ttttgaagga gttgacgtta tccatggatg 3480 

aactggaaga aaattttcga ggtatgctga aaagcgcagc caccaaagtc accacagtac 3540 

tagctaccaa gactgccgag tatagcgaga tggtatcgct ctttgaaact ccaaggaaga 3600 

ctcgggaacc ctttatctta gaggaaaatg aaatatatgg gtttgaccta gcttcgtctc 3660 

atttggacac tgtaaagcta atcttgaaca taaacattga atcaccagtt gtttctatcc 3720 

ctcggaagcc ggggagtcct gagttgttgg tgggacactt gggacagata ttcatccaga 3780 

attttgtggc gggagatgat gaatccagaa gtgaccgtct gcaggtggaa atcaaggaca 3840 

ttaaactgta ttctttgaat tgcacccagt tggcaggtag agaagctgtt gggtctgaag 3900 

gaagccggat gttttgccca ccttccgggt ctggcagcgt ttaatcagca gtaatgaagt 3960 

aaatattaag gtttatggtc tatcatttca tctcttttat gccagctttt cacatcctga 4020 

acaacaccac cattcagttt aaactggaga agatccctat agagagagaa tctgaattga 4080 

ctttttctca tttatgggca ttacatctac gtgcatttaa taataagagt aactgaattt 4140 

ttctcttttc aatacaggtg gtgttagcaa agcatgtata tgagcaggtt ttacaaaccc 4200 

tggacaatct cgtgtacagt gaagatctga ataagtatcc agccagtgct acctccfcccc 4260 

cttgccctga ttctcctctg cctcccctca gtacctgtgg agaatcttct gttgaaagga 4320 

aggagaatgg attgttcagc cactccagcc tttctaacac ctctcagaag tcattgtcag 4380 

tgaaggaagt caaatccttt actcagattc aagccacctt ttgtatatca gagcttcagg 4440 

ttcagctaag tggagatctg actttggggg cccaaggtct tgtgagctta aagtttcagg 4500 

actttgaggt ggaattcagt aaagaccatc cccagacttt atctattcag attgccctgc 4560 

attctctgct gatggaggac ttattggaga agaatccaga ttctaaatat aagaacctga 4620 

tggtgtctcg aggagcccct aagccatcta gtttagcaca aaaagaatac ctttctcagt 468 0 

cttgcccctc agtgtccaat gtggaatatc ctgatatgcc tcggtctctc ccttcccaca 4740 

tggaagaagc tcctaatgtc ttccagttgt atcaaaggcc cacctctgcg tcccggaaaa 4800 

agcaaaagga agtccaagac aaggactatc ccttgacccc acctccttct ccaacagtgg 4860 

atgagcccaa gatacttgtt ggaaagagta aatttgatga ttccttagtc cacatcaaca 4920 

tattcttggt agataagaaa catccagaat tctcttccag ttacaatcga gttaaccgga 498 0 

gcattgatgt tgattttaat tgcttggatg tgctgatcac actgcaaacc tgggttgtga 5040 

tattagactt ttttggaatc ggctccactg cagacaacca cgcaatgagg ctgcctcctg 5100 

agggcattct gcacaacgtg agccgttttc ctgaaagata ccttttctcc ccagtaatat 5160 

tggccttgct ttatcttgtc tgttcctagg ttcattcact ttctctagtg ctccagacat 5220 

atttaagttg tgtgacaaaa aaggatacat agtaatgtgt tctaacttta tcatccctag 5280 

agggagacct ggccttacag ggcagcattg ggagtctgtc tctaagtgac ctcacatgct 5340 

gagttttctt gaaatactaa cttgtgtctt gctgttatct gagtcacact tttatcctaa 5400 

tttttagata cggacggcct gaccctctgc tccggagaga acacgacatt cgcgtgagcc 5460 

tccggatggc ctctgtgcag tatgtgcata ctcagcgttt ccaggcagag gtggtggcct 5520 

c.tgaggttat ttggtggtaa aattagggaa ttaattatcc tctgagtcag ttttccttcc 5580 

tctttcaggt gagagatcaa gcccagcgct gttcacgggt tctcctggat attgaggctg 5640 

gtgctcccgt tctcttgatc ccagaaagtt ccagatcaaa taatctgatt gtagcaaatt 5700 

tggagagtcc caccaccagt gctgccccct tcccatttcc tgcattctag ttttattttg 5760 

ttccttccca ggaatctgtg ccttcagctt ccccaacggg tattcccaaa cacagtctga 5820 

ggaaaacgac aagcacggag gagcccaggg gaacccattc ccaggggcag ttcacgatgc 5880 

ctcttgctgg aatgagccta ggaagcctga agagtgagtt tgtgcctttg tttagtgcaa 5940 

cccctttttg ggtccccttt tcttcattac ctctctttcc ttgggtcctt gtagaagacc 6000 

atgtctgcct gctggattgc gttgtcgtgg atctccagga ccatggacca tctttgctgc 6060 

ggagagacat ccggagagat tactcgaagg gcaccagagg atagtagtgg agatctgatc 6120 

ttcccttcct attttgtgcg gacagacagg aggaagcctc cttaaccgag ccttgtaggc 6180 

tgaaattgca ggtggaaagg aatttggaca aagaaataag tcatactgtg ccagacatat 6240 

ctatccatgg caatctctcc tcagtccact gctctctgga tctgtataaa tacaagctga 6300 

tccgcggctt attagagaac aacctgggag aacccataga ggaatttatg cggccttatg 6360 

atttacaaag atccaagaat tcatactgtc ctgagtggag aagtgtacac ctgtatgtgc 6420 

ttcctcattg atatggtgaa tgtaagtctg gagcttaaag atcc 6464 



<210> 175 

<211> 1186 

<212> DNA 

<213> Homo sapiens 



97 



WO 01/74836 



PCT/US01/10472 



<400> 175 

tttcgtgcta ccagaagaag tgtcgggtac gtctgcatta cacctggcgg gagctctggt 60 

cagccttgat aaatttgctg aagttcctta tgtcaaatga gactgtactt ttggccaaac 120 

acaacatttt tacattagcc cttatgattg tgaacctatt taatatgttt atcacatatg 180 

gcgacacatt tctgccaacc cccagcagct atgatgaact ttactatgag attatccgca 240 

tgcaccagag ctttgacaac ctctactcca tggtcctgag gctttctacc aatgcaggcc 300 

agtggaagga agcagctagc aaggtgaccc atgcattggt taatatcaga gccatcatca 360 

accactttaa ccccaaaatt gagtcctacg ctgctgtgaa tcacatatcc caactgtcag 420 

aggagcaggt gctggaggtg gtgagagcca actatgacac gctcacgctg aagctgcagg 480 

atggcctgga ccagtatgag cgctactcag agcagcacaa ggaagctgcc ttcttcaaag 540 

agctggttcg atccattagc accaacgtcc ggagaaacct ggccttccac acactcagcc 6 00 

aagaagtcct gctcaaggag ttctccacta tctcctgagg ccacgcctac ctgagcagcc 660 

tctgactgcc cttacccatg aggatcatgg gctggagggg gagcgaaggg agagggggct 720 

gcccccgagg ttggagagaa acacagatac caagttctct tggtgaagac cgcacttcaa 780 

tggagcttgc gcagcagggg caggagggtc aagccaggga taatcttatt tggggaggaa 840 

tgggtgggca cagtggggag aggcctccag gaatgtgggg acttccagag tgggctccct 900 

aaacagcccc tcacatttcc aatcttgagg ttacaagctg tagctactga ctaattttcg 960 

ggagctggtg aggcaagcct cagggtaatg ccagcaccct gaaagtcagt ggctctgctc 1020 

ggcttctgaa aacagatcaa ctctcagata tagctggagc tacatcctgc ttcctttggg 1080 

cctggggctg tgcttggctt gaaatctgtg tcccctgcct gcactggcac tttgtccagt 1140 

catcgggacc tttggctcag atttagggta agataggaat agggta 1186 



<210> 176 

<211> 2477 

<212> DNA 

<213> Homo sapiens 



<400> 176 

gcggctttgg tggaccacag tacaagccta tgaccccagg gcagctctcc aacgtgcggg 60 

cgccggggtc ggctgagaag ggcagcgggg acacggggga tgcccggccg ccctcggccg 120 

cgccgcctgg ggggagcgct ggcgaggcac ggacggcggg cgcccggtac ctctgcccgc 180 

ggtcctcgct ctcgggcggg gcggcggcga cgcggacctg cggactagcg aacccggagg 240 

a gg aa gg a cc ctctgctaaa tgtggcgaga atggtagtgc agagaggact gaccttgggg 300 

gaaacaagta caaccaggag aggatacaaa tagagtatgt ggaggtgctg tttgcagatt 360 

tctttagaga agtttttgcg atatgtggaa gctgtgatgc tttgggaaac tggaatcctc 420 

aaaatgctgt ggctcttctt ccagagaatg acacaggtga aagcatgcta tggaaagcaa 480 

ccattgtact cagtagagga gtatcagttc agtatcgcta cttcaaaggg tactttttag 540 

aaccaaagga aaacattcat catcgcggtg acttccttgt aacctttcca agttcttcac 600 

gttcctcatt tgttcagact ggacagttct ctgggaggga catagacaag gacccaaaac 660 

taagtcctgt gggcaggggc tggggttttg aatgggctat agagctatgc atggcagtca 720 

aagaggatgt tcgacaggaa gtaggaagcc atatagggtt acttcctgat gttgccatgg 780 

catttgtaaa ctgtcgtggc actgatggga gtgtagcagt gaggatgacc agaggtcact 840 

ctcattgcca tcttggtttt gcctactgtg ctagtgggtt ctccttggag ccttgtgttg 900 

aaaatgactg tggggcctct tctgcagagg tgcaacaggg cttcgtgttc atcacatctg 960 

cctcctcctc ttcctcatat tgcactgagg ccaaaagggt gaagctgaca ctagaaggcc 1020 

tggaggaaga tgacgatgat agggtatctc ccactgtact ccacaaaatg tccaatagct 1080 

tggagatatc cttaataagc gacaatgagt tcaagtgcag gcattcacag ccggagtgtg 1140 

gttatggctt gcagcctgat cgttggacag agtacagcat acagacgatg gaaccagata 1200 

acctggaact aatctttgat tttttcgaag aagatctcag tgagcacgta gttcagggtg 1260 

atgcccttcc tggacatgtg ggtacagctt gtctcttatc atccaccatt gctgagagtg 132 0 

gaaagagtgc tggaattctt actcttccca tcatgagcag aaattcccgg aaaacaatag 138 0 

gcaaagtgag agttgactat ataattatta agccattacc aggatacagt tgtgacatga 1440 

aatcttcatt ttccaagtat tggaagccaa gaataccatt ggatgttggc catcgaggtg 1500 

caggaaactc tacaacaact gcccagctgg ctaaagttca agaaaatact attgcttctt 1560 

taagaaatgc tgctagtcat ggtgcagcct ttgtagaatt tgacgtacac ctttcaaagg 1620 

actttgtgcc cgtggtatat catgatctta cctgttgttt gactatgaaa aagaaatttg 1680 

atgctgatcc agttgaatta tttgaaattc cagtaaaaga attaacattt gaccaactcc 1740 

agttgttaaa gctcactcat gtgactgcac tgaaatctaa ggatcggaaa gaatctgtgg 1800 

ttcaggagga aaattccttt tcagaaaatc agccatttcc ttctcttaag atggatggaa 1860 

tgtgggatgg taacttatca acatattttg acatgaatct gtttttggat ataattttaa 1920 

aaactgtttt agaaaattct gggaagagga gaatagtgtt ttcttcattt gatgcagata 1980 
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tttgcacaat ggttcggcaa aagcagaaca aatatccgat actattttta actcaaggaa 2040 

aatctgagat ttatcctgaa ctcatggacc tcagatctcg gacaaccccc attgcaatga 2100 

gctttgcaca gtttgaaaat ctactgggga taaatgtaca tactgaagac ttgctcagaa 2160 

acccatccta tattcaagag gcaaaagcta agggactagt catattctgc tggggtgatg 2220 

ataccaatga tcctgaaaac agaaggaaat tgaaggaact tggagttaat ggtctaattt 2280 

atgataggat atatgattgg atgcctgaac aaccaaatat attccaagtg gagcaattgg 2340 

aacgcctgaa gcaggaattg ccagagctta agagctgttt gtgtcccact gttagccgct 2400 

ttgttccctc atctttgtgt ggggagtctg atatccatgt ggatgccaac ggcattgata 2460 

acgtggagaa tgcttag 2477 



<210> 177 
<211> 3805 
<212> DNA 

<213> Homo sapiens 



<400> 177 

atggagccca ttgagggcaa gagatccagc tgccataaaa caggagaagc tacagcagtt 60 

gtccactgtc cacccggctg gaacatcacc atgggggtgg aagcctcatg tgccttcgtc 120 

ggccgcgccg gctcccagga tacagtgcgg accggccggg cgctaaaggc gttgacgcaa 180 

ctgcgggcgg cgcagggcag gggcagtcag ggagcagcgg cggcagagac agggctgggc 240 

gggcgccgcc tgcggagagc accgggaggc gggccctgcg tggggccgcg cgcagcagcg 3 00 

gcgacgactt tatctgggcc gcgggggaca gcgcagggcc atggaggagg cggccgcagc 360 

tccggtaagg gcgaccagcg cgcgcacgaa ctcgcggcct ggattccccg ggcgacccgg 420 

gcccggcaca cgggggctgc gggagccgag ccgtactacc gcgcctgggg ctccggggaa 48 0 

cagggccgcg gcgtctgccg gggcctactg cgcttgccag ccgggccccc cactccgggc 540 

cgagcccgag cgctggcgga acggttgtcc ccgccccggg ccgccccacg ccaagattct 600 

tggcccctcc ggggcttcct gcccccgccc cagccgctga accctacatc tgcgtccccc 66 0 

cacccgaggc tcttttctct gcttggcgct cggcctattt ctccgtggac gatggcagcc 72 0 

acgattcagg ccatggagag gaagattgaa tcgcaggctg ctcacctgct ttccctagaa 78 0 

ggtcaaaccg ggatggccga gaagaagctg gctgattgcg agaagacagc cgtggagttc 84 0 

gggaaccagc tggagggcaa gtgggccgtg ctgggaaccc tgctgcagga gtacgggctg 900 

ctgcagaggc ggctggagaa cgtggagaac ctgctgcaca acaggaactt ctggatcctg 96 0 

cggctgcccc caggcagcaa gggggagtcc cctaaggtag ccctggggcg ccctggggtt 102 0 

ggggaggccg cggccaagcc tgtgagcgtg tggttctctg agcaagtgtg gggcaagctg 108 0 

gaggactggc agaaggagct ctgcaagcac gtgatgaggg gcaactgcga gatgctggtc 1140 

tctttggact acgccatctc caagtocgag gtcctctccc agattgaaca agggaaggag 12 00 

ccctgcaact ggcgccgccc tggccccaag attccagatg ttcctgtgga ccccagtcca 126 0 

gccccagtgc ccttgccctt gttttgttca ttgtatccac ctggtgaaat ccatcagtgt 132 0 

tctgtacctg cagccaaaca gctgcacgtg gtgcagagaa ccagtcctgt gactgctaaa 13 8 0 

ctgtcgactc tccagccaaa gccccacttc cacctggtgc tgcatcccac cccatgccag 1440 

cttctcaagg gcaacacagt caatcctacc ttgacctcta cccctacagc cacggcctgc 1500 

ttctctgctc cgcttcgtgg cagagcccct tggatctata ccatggaagg gaacagatta 1560 

aaccagtgtt tccaaactgg gtgctggagg gctcctgggc acatccaggc cggggaagaa 162 0 

gcccccggga gccgtgtggt gttcacacgc atcacaggtt caggagaatg caggaggggg 168 0 

cctgagaaga gctgcggatt tggtcactcc cgggaagcgt taggagagga gtggatgatt 1740 

cggaaggtga aggtggagga cgaagatcag gaggcagaag aggaggtcga atggccccag 18 oo 

catctatcgt tacttcccca gcccctttcc cgcgcctgac ctggggcatc tggctgccgc 186 0 

gtacaaactg gagccagggg ccccgggggc actgagtggg ctcgcgctgt ctgggtgggg 192 0 

tccgatgccg gagaagccct acggctgcgg ggagtgtgag cggcgcttcc gggaccagct 198 0 

gacgttgcga ctgcaccagc ggctgcaccg gggcgagggc ccctgcgcct gcccggactg 2040 

cggccgcagc ttcacgcagc gcgcccacat gctactgcat cagcgcagcc accgcggcga 2100 

gcggcctttc ccgtgctccg agtgcgacaa gcgcttcagc aagaaggccc atctgacccg 2160 

ccacctgcgc acgcacacgg gcgagcggcc ctacccgtgc gcggagtgcg gcaagcgctt 2220 

cagccagaag atccacctgg gctcgcacca aaagacccac accggcgagc ggcccttccc 2280 

ctgcacggaa tgcgagaagc gctttcgcaa gaagacgcac ttgattcggc accagcgcat 2340 

ccatacgggc gagaggccct accagtgcgc acagtgcgca cgcagcttca cgcacaagca 2400 

gcacttggtg cggcaccaaa gggtgcacca gacggccggc ccggccaggc cctctcccga 2460 

ctcgtccgct tctcctcatt ccactgcccc gfcccccgacc ccatcctttc ccgggccaaa 2520 

gcctttcgcg tgctccgact gcggcttgag cttcggctgg aaaaagaacc tcgccacgca 2580 

ccagtgtctg caccgcagca gagggtcgcc cctttgggtg cgatgagtgc gcactgggcg 2640 

ccaccgtgga tgcccccgcc gccaagcccc tggccagcgc gcctggcgga ccgggctgcg 2700 
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gcccaggatc cgatcccgtg gtgccccagc gcgccccctc gggcgagcgg tccttcttct 2760 

gcccggactg cgggcgcggc ttctcccatg ggcagcacct ggcgcggcac ccgcgcgtgc 282 0 

acacgggcga acggcccttc gcctgcacgc agtgtgaccg ccgcttcggc tcgcggccta 2880 

atctggtcgc ccactccagg gcccacagcg gcgccaggcc ttttcgcctg cgctcagtgc 2940 

ggccgccgct tcagccgcaa gtcgcacctg ggtccgccac caggcggtgc acactggcag 3000 

ccgcccccac gcctgcgccg tctgcgcccg cagcagcttc agctccaaaa ccaacctagt 3060 

ccgccaccag ggcattccac acaggctccc gccccttctc ctgcccgcag tgcggaaaga 3120 

gcttcagccg caagacccac ctggtgcggc accagctcat tcacggcgaa gccgcccacg 318 0 

cggccgccgg acgccgccct tgcggcccca gcctggtccg ctccccccga ggtggcgccg 3240 

ccccccgctc ttcttctgag cctagttctc acgaggaccc tttcttgccc cacagtttcg 3300 

aaaaggcccg tgcccatgaa gaccgccctg gggtgaagca aggcgaccct ggctgctgcc 3360 

cgaagggttt ggccgccgcg ggacacctgt ttccttcccg cagtgtctgc gtccgcacag 3420 

catacccagc tcggacctcc taggacagag actcagcgaa cccttgctgg gaaccgctga 3480 

gctgaagttc ttggaaggct cccacccagg tgccccgttg gaaagcagat atttcccgga 3540 

cccagcgcgg cctcaaccag gggcaggaaa gagtggttat ttatgtactt aaagtttcat 3600 

taaagttaaa atcggaaagt tctggggctg ctaaatgaat tgggggaggg aacacctgac 3660 

tctccttagc cactgtccca cctccatcca cacacagacc acccccctcc acttcccctt 3720 

ctgtcctggg tgagttacat ttagccagct gctgttaatt ggttctccac aaataaaaat 378 0 

gtaatatatt tattcactaa aaaaa 3805 



<210> 178 

<211> 4485 

<212> DNA 

<213> Homo sapiens 



<400> 178 

ccatactgca gtgtcgtgga attcggacag gcagagtgct gatgtgaaaa ataccacgac 60 

aaaaccaccc tagttctcca aaatacttat cattcaggtt gaataaacca gttctacaga 120 

ctgcttcttc catgcatttc ccaatgatgc taactgccct taaaattctt tcagatttgg 180 

aactgtatct gtatggaaac aacagagttt tcttgaagta caactaagga caggtcacca 240 

ctaggcacta acatcgctga cttgcatgat tatggagatg gtctatctga tgctgaaaat 300 

gtctctagtt ttttgacaac ggctaaataa ccatgggatc aagtggcctt gggaaagcag 360 

caacattaga tgaactgctg tgcacttgca ttgagatgtt tgatgacaat ggagagctgg 420 

ataatagtta tttgccaaga atagttctac tgatgcaccg atggtattta tcttccactg 480 

aattggcaga aaaacttctc tgcatgtatc gaaatgccac tggagaaagc tgcaatgaat 540 

ttcgattaaa gatctgctac ttcatgaggt actggattct gaagtttcct gcagagttta 600 

atttggatct tggtttgatt cgtatgactg aggaatttcg ggaagtagct agtcaactag 660 

gatatgaaaa acacgtcagc ctcatcgaca tatccagcat tccttcctat gactggatga 720 

gaagagtcac acagaggaaa aaagtatcca agaagggaaa agcctgtctg ctgtttgacc 780 

atctggagcc cattgaattg gctgagcacc tcacttttct ggagcataaa tcttttagaa 840 

ggatctcatt cactgattac caaagctatg tcatccatgg ctgcctggag aataatccaa 900 

ccttggaaag atcgattgct ttatttaatg gaatctctaa gtgggtccag ttgatggttc 960 

ttagcaaacc aaccccccag caaagggcag aagtcatcac aaagtttatc aatgttgcaa 1020 

agaagctcct tcagctcaaa aattttaaca accttattgc aattgtggga gccctcaagt 1080 

cataggtcca tttcaggctt caaagggacc cattcctcat ctttcctcag aagtttacaa 1140 

agaactggaa tgtgaaatga cagaagtggg tctcctccaa cgggcaatta ctgcaattac 1200 

cgcaagccct ttgccgactg cgatggcttc aaaatcccca tccttggagt acacttgaaa 126 0 

gacttgatag ctgtccatgt cattttccca gactggacag aggagaacaa agtgaacatt 1320 

tgtgaaaatg caccagctct ccgttaccct gagtgaacta gtctccctgc agaatgcctc 1380 

tcaccactta gaacccaaca tggatttgat caacctgctc acgctttccc tggacctcta 1440 

tcacactgaa gatgatattt acaaactgtc actggtgctg gagcctagaa attctaaatc 1500 

gcagcctacc tcccctacga cgcccaacaa gcctgtggta cccctggagt gggcattagg 1560 

ggtgatgcca aagccagacc ccacggtcat caacaagcac ataaggaaat tagtggagtc 1620 

tgtatttaga aactatgatc acgaccatga tgggtacatt tcccaagagg actttgaaag 168 0 

tatagctgcc aattttccct tcttggattc cttctgtgtt ctggacaaag atcaggatgg 1740 

cctaattagt aaagatgaaa tgatggctta cttcctgaga gctaaatccc aactacactg 1800 

tcaaatcgcg caccaggatt tatccacaat tttcaggaga tgactctatc tcaagccaga 1860 

ccttctgcga acactgtgcg ggatttattc tgtggcataa tcaagcaagg atacaaatgc 1920 

aaagactgtg gagccaattg tcacaaacag tgcaaagacc tcctggttct ggcctgcagg 1980 

agaatttgcc cgggcgccct ccttgagcag tggtcatggg tcactgcctg gaagcccctc 2040 

gctgccccca gcgcagggat taggtgttta agttccctgg agtcactgct gacaacagcg 2100 
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attttagaca gcagagccat cacactggtt acaggctctt ctcgcaagaa cttctgtgag 2160 

gctacagagg gccaccacca gccaggccac cccagactgg aacctgtctg gtcagaggct 2220 

ggctgggggg gactcggggt cccacaccct tccctataat agatattcag gttccttgca 2280 

caagccacca aaaggcacaa gggctttgca atatgggaaa aatgaaaaag cccagggtgg 2340 

catggctggt ggggatgttt aaaacccggg caccggagtt tgaaccttgc cccggatgaa 2400 

ggagagaaga ccacacacgg atgggtgagg atgggctgac ttcaaggctg gcggaaaact 2460 

tgaaggccaa taatggttgg cttttgggag ggggcaagaa caagaaactc ttgaggaaag 2520 

ctcttgcctc tcaggaagtt atcttggaaa gaaccccctg atgttaactt gccttgggga 2580 

cactgtggga tctccatgtt tggactatgg gacagagaat tgaccctaac taaactatga 2640 

attattatgt gcctcctccc ctacccctag ttaagtgcca caaagactgt gttatgtagt 2700 

cagtactttt ttctcatgga tctttctcta gaccatttat atacgggctg aaatgaaagc 2760 

ctttttgcat gtacttgata ctcagtctgt aaactcagac ttcgcttttt ttgtgagact 2820 

atcctttcaa tatttttata aacttttgtg tgtgctgtgg cagggggaca gttaacaagg 2880 

gagtttattt agaggggttt ttttcatttt ttgtttttct tgttttgttt tgttttgttt 2940 

tgttcttctt ggtgtgcttt ttggatggga ccagaactta acattttcct gaggactagg 3000 

aagctcatca gacactggaa tgcagtgatt cttctaatga tggtcaactg caaggagaaa 3060 

ctgtttacat ccttcccact acacagttgc tatgatatgt aacaaagtca ccacaatttt 3120 

tgtatatatc acacaagtct tacccaattc tgcgtcacta gggaagctat agcatggtcg 3180 

agaaagctct cctaatactt acttacttaa atagccatgg aagaaaatta tgcttgtccc 3240 

ttctgactgg gttaagccat ctttcttaaa gattcctgaa gtagaaatag caattaagaa 33 00 

attaattcaa gttttgacgg ttaacaaaca tttagcagaa aaacaaacca ttgaataagc 3360 

tactaggtga aatgcaaaac aaccatcaat ttttttaaac tcaataattt tgtgaaaaaa 3420 

aaaaccacca ccaaactttt ggttgtaaaa gagaaccctt gttcccttcc taagaaactg 3480 

ccttccacaa ttaaagaata ctcatagaaa gaaaataatc ataaatcata tttgattaac 3540 

atttcattta agcttttaaa atatcaattt tttaaatagc tcccatttac ccaaaataac 3600 

tagttatggg tctatttctt ccatgattag aggagagaga acttgattca agatactgaa 3660 

aaatagagct gggactgagc ctgtgaatga caggaatgat ctattaccag gtgactagtc 372 0 

ggaaactaca cagaccgtgc actggtctgc atttgggggc taaagtgtat atattccatt 3780 

taaaatggaa tttgtttgta tttggggcag tcttcagtaa agcctttctc ctttcttttc 3840 

tccattgagg accagttctg ctcagccaaa attgttgagt ccctgttctg ggccaggtcc 3900 

cacgctatat gcaccaaagt taggaaaaac ttgggtctta agcccttggg gagcttgacc 3960 

agcccatggg cattaaggca aagtagttcc cagtgattta aaatacgggt tccaaatacg 4020 

ctaaaaccaa acttgtgcca acccagattt acagattggg aaatactgca gatgatgtga 4080 

agttatcagt tgaaggagct gtgatttagg ctggataata agagaacgtg cccatctgta 4140 

aagcacctca gaaggcagcc atcccctaga tgttggtttc gatgtatatt acactatcta 4200 

ctactatcca taaatgcaat aatatgcatg ttaacaacat taaaaacagc aaacagcaat 4260 

ctaagtacag aaaagctttt tgtgtgttta aaaaaattga agaaaattca ggaagaaacg 432 0 

tgttaataaa cattgtactg ttcttttgct tctcaaagga attattcact tgccactttg 4380 

gttatttttg agttttcgta cataggaagt tttatattgc cagccttcct gtgataaaga 4440 

tattaaatgt aacttgagta aaaaataaaa aagaataact atgta 4485 



<210> 179 

<211> 4864 

<212> DNA 

<213> Homo sapiens 



<400> 179 

acgcagctcc gccccgcgtc cgacccgcgg atcccgcggc gtccggcccg ggtggtctgg 60 

atcgcggagg gaatgccccg gagggcggag aactgggacg aggccgaggt aggcgcggag 120 

gaggcaggcg tcgaagagta cggccctgaa gaagacggcg gggaggagtc gggcgccgag 180 

gagtccggcc cggaagagtc cggcccggag gaactgggcg ccgaggagga gatggaggcc 240 

gggcggccgc ggcccgtgct gcgctcggtg aactcgcgcg agccctccca ggtcatcttc 300 

tgcaatcgca gtccgcgcgt cgtgctgccc gtatggctca acttcgacgg cgagccgcag 360 

ccctacccaa cgctgccgcc tggcacgggc cgccgcatcc acagctaccg aggtcacctt 420 

tggctcttca gagatgcagg gacacacgat gggcttctgg ttaaccaaac tgaattattt 480 

gtgccatctc tcaatgttga cggacagcct atttttgcca atatcacact gccagtgtat 540 

actctgaaag agcgatgcct ccaggttgtc cggagcctag tcaagcctga gaattacagg 600 

agactggaca tcgtcaggtc gctctacgaa gatctggaag accacccaaa tgtgcagaaa 660 

gacctggagc ggctgacaca ggagcgcatt gcacatcaac ggatgggaga ttgaagattt 72 0 

ctgttgaaac ttacactgtt tcatctcagc ttttgatggt actgatgagt cttgatctag 780 

atacaggact ggttccttcc ttagtttcaa agtgtctcat tctcagagta aaataggcac 840 



101 



WO 01/74836 



PCT/US01/10472 



cattgcttaa aagaaagtta actgacttca ctaggcattg tgatgtttag gggcaaacat 900 
cacaaaatgt aatttaatgc ctgcccatta gagaagtatt tatcaggaga aggtggtggc 960 

atttttgctt cctagtaagt caggacagct tgtatgtaag gaggtttata taagtaattc 1020 

agtgggaatt gcagcatatc gtttaatttt aagaaggcat tggcatctgc ttttaatgga 1080 

tgtataatac atccattcta catccgtagc ggttggtgac ttgtctgcct cctgctttgg 1140 

gaagactgag gcatccgtga ggcagggaca agtctttctc ctctttgaga ccccagtgcc 1200 

tgcacatcat gagccttcag tcagggtttg tcagaggaac aaaccagggg acactttgtt 1260 

agaaagtgct tagaggttct gcctctattt ttgttggggg gtgggagagg ggaccttaaa 1320 

atgtgtacag tgaacaaatg tcttaaaggg aatcattttt gtaggaagca ttttttataa 1380 

ttttctaagt cgtgcacttt ctcggtccac tcttgttgaa gtgctgtttt attactgttt 1440 

ctaaactagg attgacattc tacagttgtg ataatagcat ttttgtaact tgccatccgc 1500 

acagaaaata cgagaaaatc tgcatgtttg attatagtat taatggacaa ataagttttt 1560 

gctaaatgtg agtatttctg ttcctttttg taaatatgtg acattcctga ttgatttggg 1620 

tttttttgtt gttgttgttt tgttttgttt tgtttttttg ggatggagmc tcactcttgt 1680 

cacccaggct ggagtgcagt ggcgccatct cggctcactg caacctctgc ctcctgagtt 1740 

cacgtaatcc tcctgagtag ctgggattac aggtgcctgc caccacgctg gccaattttt 1800 

gtacttttag tagagacagt gtttcgccat gttggccagg ctggtttcaa actcctgacc 1860 

tcaggtgatc cgcccacctc agcctcccaa aatggtggga ttacaggtgt gtgggccacc 1920 

gtgcctggct gattcagcat tttttatcag gcaggaccag gtggacttcc acctccagcc 1980 

tctggtccta ccaatggatt catggagtag cctggactgt ttcatagttt tctaaatgta 2040 

caaattctta taggctagac ttagattcat taactcaaat tcaatgcttc tatcagactc 2100 
agttttttgt aactaataga tttttttttc cacttttgtt ctactccttc cctaatagct 1 2160 

ttttaaaaaa atctccccag tagagaaaca tttggaaaag acagaaaact aaaaaggaag 2220 

aaaaaagatc cctattagat acacttctta aatacaatca cattaacatt ttgagctatt 2280 

tccttccagc ctttttaggg cagattttgg ttggttttta catagttgag attgtactgt 2340 

tcatacagtt ttataccctt tttcatttaa ctttataact taaatattgc tctatgttag 2400 

tataagcttt tcacaaacat tagtatagtc tcccttttat aattaatgtt tgtgggtatt 2460 

tcttggcatg catctttaat tccttatcct agcctttggg cacaattcct gtgctcaaaa 2520 

atgagagtga cggctggcat ggtggctccc gcctgtaatc ccagtacttt gggaagccaa 2580 

ggtaagagga ttgcttgagc ccagaacttc aagatgagcc tgggctcata gtgagaaccc 2640 

gtctatacaa aaaattttta aaaattagca tggcggcaca catctgtaat cctagctact 2 700 

tggcaggctg aggtgagaag atcattggag tttaggaatt ggaggcggca gtgagtcatg 27 60 

agtatgccgc tgcactccag cctgggggac agagcaagac cctgcctcaa aaaaaaaaaa 2820 

aaaaaaaatt caggccggga atggtggttc acgcctgtaa tcccagcact ttggggggtc 2880 

gaggtgggca gatcacctga ggtcaggagt tcgagaccag cctggccaac atggtaaaac 2940 

cccatttcta ctaaaaaata caagaattag ctgggtgtgg tggcgcatgc ctgtaatcct 3000 

agctactcag gaggctgagg caggagaatc acttgacccc aggaggcgaa gattgcagtg 3 060 

agctgatatc gcaccattgt actccagcct gtgtgacaga gcaatactct tgtcccaaaa 3120 

aaaaaaaaaa ttcaaatcag agtgaagtga atgagacact ccagttttcc ttctactccg 3180 

aattttagct cctcctttca acattcaaca aatagtcttt tttttttttt tttttttttg 3240 

gggatggagt ctccctctgt tgcccaggct ggagtgcaga ggtgcgatct ctgctcacta 33 00 

caagctctgc ctcccgagtt caagtgattc tcctggctca ccctcctgag ctgggattac 3360 

aggcgcctgc caccatgcct ggctaatttt gtgtttttag tggagacggg gtttcaccat 3420 

gttgtccagg atggtcttga tctcctgacc ttgtgatcca cccacctcag cctcccaaag 34 80 

tggtgggatt acaggtgtga gccaccgcgt ccagccagct ttattatttt ttttaagctg 3540 

tctttgtgtc aaaatgatag ttcatgctcc tcttgttaaa acctgcaggc cgagcacagt 3600 

ggctcatgcc tgtaatccca gcattttggg agaccaaggc ggatggatca cctgaggtca 3660 

ggagctcaag accagcctgg ctaacatggt gaaacctcat ctccacttaa aatacaaaaa 3720 

ttgccggccg cggcggctca tgcctgtaat cccagcactt tgggaggcct aggcgggtgg 3780 

atcacgacgt caggaaatcg agaccatcct ggctaacacg ggtgaaaccc catccctact 3840 

aaaaatacaa aaaaaaatta gccgggcgtg gtggtgagcg cctgtagtcc cagctactcg 3900 

agagcctgag gcaggagaat ggcatgaacc tggaaggtgg agcttgcagt gagctgagat 3960 

ggtgccactg cactctaacc tgggcgacag agtgagactc cgtctcaaaa aaaaaaacaa 4020 

aaaccaaaac ttatccaggt gtggcggtgg gcgcctgtga ggcaggcgaa tctcttgaac 4080 

ccgggaggcg gaggttgcag tgagccaaga tcacaccatt gcactccagc ctgggaaaca 4140 

agagtgaaat tccatctcaa aaccaaattt tcaaaaaaaa aacatgccgc ttgagtactg 42 00 

tgtttttggt gttgtccaag gaaaattaaa acctgtagca tgaataatgt ttgttttcat 4260 

ttcgaatctt gtgaatgtat taaatatatc gctcttaaga gacggtgaag ttcctatttc 4320 

aagttttttt tgttttgttt tgtttttaag ctgtttttta atacattaaa tggtgctgag 43 80 

taaaggaaat aggcagggtg tgttgtgtgg tgttttaact aggcgcttct ctctcagaga 4440 

gttttgaaac ctgtttacat aaaggcccaa gatgggaagg agatccaaac ataagccacc 4500 

agcctcattc caagtctctt ctctttccaa ccctggattt ttttttttta tttaacattg 4560 

tttcttttag ctttattttt cttataaaag aaatgtatca ctataaaaaa ttacacacta 4620 

cagaaaaata ttaagaagaa aaacattcac atcggaaaca aagttttttc ccatgaaaac 46 80 
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agaacccaaa agggtaagtg gttagtattt 
gcgaggtggc tcacgcctgt aatctcagca 
gaggtcagga gtttgagacc agcctggcca 
taaa 



caccagcaat tatgttgaga ataaggccag 4740 
ctttgggagg ccagggcagg cagatcatct 4800 
acatggtgaa accctatctc tactaaaaat 4860 

4864 



<210> 180 

<211> 6397 

<212> DNA 

<213> Homo sapiens 



<400> 180 

atgcccgcgc cgcgcgcccg ggagcagccc cgcgtgcccg gggagcgcca gccgctgctg 60 

cctcgcggtg cgcggggccc tcgacggtgg cggcgggcgg cgggcgcggc cgtgctgctg 120 

gtggagatgc tggagcgcgc cgccttcttc ggcgtcaccg ccaacctcgt gctgtacctc 180 

aacagcacca acttcaactg gaccggcgag caggcgacgc gcgccgcgct ggtattcctg 240 

ggcgcctcct acctgctggc gcccgtgggc ggctggctgg ccgacgtgta cctgggccgc 300 

taccgcgcgg tcgcgctcag cctgctgctc tacctggccg cctcgggcct gctgcccgcc 360 

accgccttcc ccgacggccg cagctccttc tgcggagaga tgcccgcgtc gccgctggga 420 

cctgcctgcc cctcggccgg ctgcccgcgc tcctcgccca gcccctactg cgcgcccgtc 480 

ctctacgcgg gcctgctgct actcggcctg gccgccagct ccgtccggag caacctcacc 540 

tccttcggtg ccgacgcagg tgatggatct cggccgcgac gccacccgcc gcttcttcaa 600 

ctggttttac tggagcatca acctgggtgc tgtgctgtcg ctgctggtgg tggcgtttat 660 

tcagcagaac atcagcttcc tgctgggcta cagcatccct gtgggctgtg tgggcctggc 720 

atttttcatc ttcctctttg ccacccccgt cttcatcacc aagcccccga tgggcagcca 780 

agtgtcctct atgcttaagc tcgctctcca aaactgctgc ccccagctgt ggcaacgaca 840 

ctcggccagg tcaaagctgt cacaaggcca gcagggaaac aatggcagtg aaagtaaact 900 

ccacctgttg gtggcaaaat ggcagcacac gctgggaagg gtggagttga cagtggctgt 960 

ctttggagat gactacacta atattgtccc ttttggaatc tcaaaggaca gtgcaaggct 1020 

cttggacaaa aagagagacc gtcaatgtgc ccgcgtgctg gccgacgaga ggtctcccca 1080 

gccaggggct tccccgcaag aggacatcgc caacttccag gtgctggtga agatcttgcc 1140 

cgtcatggtg accctggtgc cctactggat ggtctacttc cagatgcagt ccacctatgt 1200 

cctgcagggt cttcacctcc acatcccaaa cattttccca gccaacccgg ccaacatctc 1260 

tgtggccctg agagcccagg gcagcagcta cacgatcccg gaagcctggc tcctcctggc 1320 

caatgttgtg gtggtgctga ttctggtccc tctgaaggac cgcttgatcg accctttact 1380 

gctgcggtgc aagctgcttc cctctgctct gcagaagatg gcgctgggga tgttctttgg 1440 

ttttacctcc gtcattgtgg caggagtcct ggagatggag cgcttacact acatccacca 1500 

caacgagacc gtgtcccagc agattgggga ggtcctgtac aacgcggcac cactgtccat 1560 

ctggtggcag atccctcagt acctgctcat tgggatcagt gagatctttg ccagcatccc 1620 

aggcctggga gtttgcctac tcagaggccc cgcgctccat gcagggcgcc atcatggggc 1680 

atcttcttct tgcctgtcgg gggtgggctc actgttgggc tccagcttag tgggcactgc 1740 

tgttcctttg cccgggggct ggctgcactg ccccaaggac tttgggaaca tcaacaattg 1800 

ccggatggac ctctacttct tcctgctggc tggcattcag gccgtcacgg ctctcctatt 1860 

tgtctggact cgctggacgc tatgagaggg cgtcccaggg cccagcctcc cacagccgtt 1920 
tcagcaggga caggggctga acaggcccta ttccagcccc cttgcttcac tctaccggac ' 1980 

agacggcagc agtcccagct ctggtttcct ttctcggttt attctgttag aatgaaatgg 2040 

ttcccataaa ataaagggca tgagcccttc ctcacgacca tggtccatga caaggggcag 2100 

ggcagagggg gccttggatg ggagtccttg tgggggacca ggcaggggac ttggatcaac 2160 

aagcaccaga cgagtggcgg gggcaggcga gaggctcagt gggacctcca ccctcgttgc 2220 

cccagctggt ggctgaccag gtggctgtgg agggtcagga gctgccccgg ttcctctcca 2280 

tgtagtfcgcg aagctcctca gggtccttca gccccatgtc ctcacacacc cagcggatgt 2340 

cctcctcgcc tgccacaagg atggactgca cagcaggggc ccctacaggc tcctcaggtg 2400 

actgggctgg aggggctggc gcaaatgtca caaactctac tcgcttccgc cgccccccag 2460 

cctcctttcg ggccagggtg cttgaggagc tggtggtgcc cccaggaggg ccaggggcca 2520 

gggtaggggc ctcccctcct cccccactct cacaggggca gctcccctcc cccttgggcg 2580 

ggccagggga ctgccggtcc agctggcggc tcagttcctc ctggtcagtg cccagccaga 2640 

cccagttgtg gggctggggg gaggtggggt cagtggcact gtcgggaggt tctttgcgct 2700 

gatagcgcag gacgaagacc acaccattga ccaagaagat gaagatggcc acgcagaaga 2760 

ctcccagcag ggcgtacatg cccagctcta gctcagtgac atgctgaggg gcagggacca 2820 

tctcctcctc ctcttcctcc tcctcctccc tggcttcggt ctcctccttc ctggcctcct 2880 

cctctgcccg ctcaaacttg cccctcacac ctgtgttgcc cccgacactg cctgccacct 2940 

gccgtttacc acccatggtg gcttctgtgg ctggtgggct ccaagcaggg ctggatggga 3000 
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gagcaggggc tggagtggag gcagggggca gccccagcca ggcggtgcca gaggccagag 3060 

gcacacggtg gcggccccgg cggcagggct cgggcgggtg cagagccaca tgcagcggca 3120 

gcccctcggc gcctgcccca ctcaccacca ccccgagctg ggcaccctgc tcctcagctg 3180 

gcaggatggc accaggctcc tcggctgaga cggacagtcc caggtcacgg cggtcgtaga 3240 

gctcagctgg ggccacagtg tgatcagaga aggacagcca tagggagagg gccacctcct 33 00 

gctttggggc gggaagggct gactgtgccc agcacgtagc tgtgacctcc ccggggtggg 3360 

cagtgccccg gctcaagggt cagcaagatt gcccatcact ggctgcaccc tcagctccag 3420 

cactgagacc ttgtcgtccg tcacagccag cgcctgctcc cccaggatgg agtcagacag 3480 

tggggaacgc acctcaatgg aggtgacacc gggctcccgg cccaccacga cacggccacc 3540 

ctccagagag gctacacgcg agtccagcac gcgggcgtgt ggcgccacga ggtgggacac 3600 

gtctagcagc cagtcggggc caagcaggtg cgtgaggcgg cggccgccgt ccagcgggtg 3660 

ggccgcgaag ggggcgagga agcgcacacc ggcccgctgg tactgcaggt ggcagccacg 3720 

ggcgcgccgc tcggcctcat ccgacgcctc tgcagcgggt tccgcaggcc cttcagcagg 3780 

gccaggtacc ctccagccgc ggacctgctc gagggtggtg tcggtgagct cgatacgcag 3840 

cggtagcagg ggggcccaca cggtcagccg cagcgaggcg cggagccggc cgccaccaga 3300 

agtccacttc gcaccccccg ggcgccccgg ctctccttgc cagccacgaa cacggcatca 3960 

caggcctcaa gacacctgca ggacctgtgt gttggcagat tcgcagccga catgctctgt 4020 

cacctccacc aaggcccccc cgccgttcca cagtggacaa ggcgcacggg gacatgctgg 4080 

ggcactccag tcaagtggtg ctgtattcac cagctcctca gccttggcca gtgggataag 4140 

ggctctgatg tcccgctcag acaccaggat ttcccacacc gattttgtcc ttctctgctt 4200 

caggggcctg gcctgggtac tccagctgcc acgtgacggg gcgagtgacc gctacgcccc 4260 

caccagtgct attctccacc acaaagtcca cccataggaa ctcagacagt tcaaggggac 4320 

tgctggaatc tggctctgtg agcccagcac ggtggcaggt gatgagggtg gtgtggtgcc 4380 

tggagccctt gaagcggtcc agcttggcag tccagagtgt gggctgggct gggcgggcgg 4440 

ctgtcacatg cagccccttc ttcaccttga tccgcagggt caggaggctg gctgtgaagt 4500 

tgtgccgaag caggagggta gcactaaaga gctggccggg ccgcactggc atgtcaggca 4560 

cccgcagagt cacagcctcg tccagaggta cctcctggta ctgcgggggg tctgctgggc 4620 

gcagctccac accccccact gggagggcct gctccccagg gtcgttctcc tcgccggagc 4680 

cacagccccc agggccctca gctgcaggct caagcgtgta ggccagctcg gcccgtgtgg 4740 

tggaggcctg tgagaaccag tgcgagggaa gctccagctc caccacgcag gcgcccaggg 4800 

atggctggaa gcggcaggct tggtgagcag tgccggcagg gtgtgtggca tggagccggg 4860 

cacagggcag gctgccagac cctggtggcc aatcctgccc tttgaggtgg aagagaaccc 4920 

gggcgtaggg ct:ct g Ct gg a gtcacagccg cttccactga aacggcccgc acgtcccatg 4980 

ggactggccg ttggtggggc tcagtgactc gaggggggac cacctgctga gtggcaaaag 5040 

gtgggtagga ggcccggaga agtggctggg ccctgggcca gggctgtagg agcagaaagg 5100 

tctcagatcg ggagctcaga gaggagttgg caggtgggta gtggcccacc tgctgcacac 5160 

ggaagtgttc aggggcgtct aggagctcca gggctgccgg caggtagaca gggtccaggg 5220 

gagcctggcc acagtccacc cgaagcacag ggaagttgct gccctttccc gcagtgacag 5280 

gaatcctctc aaaggttctg gaaggcaagg atgcggccat ccctctcacg acgtccaggg 5340 

cgagggccac caggaggcag agccaggggc cgtagggccc ccgaggggcc gctgttgtgc 5400 

gaccggccat ccgcgcgcac atctcctcct ccacactgcg ggcccccttg gccgctcgcc 5460 

tctggcgcca gcgcagctcc tccacctggc gttccagccc tcgcaccttg gcctcgagtt 5520 

gggccccaga ggctcgggag ccagtgagcc ggctcagcag ggcgtagagg atcagcaagg 5580 

ccaggagtag cagggcccgg gtggaagggt caggcaccga cctcatcagg gccacgaagc 5640 

cggccaggaa gatgacaagc ttcaggcccc acaggatccg ccccagcaag gccaagacca 5700 

agccgaggag cagagacagc agccagtaga cgaccagggc ccctgctccc cacagcagga 5760 

aggtctggac ctggccaggg ctgagcttca ggccctgggc gaggtaatca ccagctagtc 5820 

ccaaggcatt cagcagctgt gcggcgatcc cagacagagc aaagaaggcc acagaaatgg 5880 

ctgatgagat ggcccacaac acttgggacg aagactctga caccaggtgc atggtctctg 5940 

gcccaatcca ggcatccagt gtccctcgca cagatcgacc tatctgggtc aagacatcaa 6000 

ctggggcttc tctcttctgt tggcctggtg gtgcaaagtc tcgacgggac tgggccaatg 6060 

ctgagtggag gaggataagg gccactagga ccatcagaat ggctttgaac acatgctttc 6120 

cccatggtga actgatgctg gaggctgcca tgactgggtc caggatctca atcttgcctt 6180 

tttgcttgtc ccatacccgc atgctgcctt gatagtggac atagtccctg ccctcatgga 6240 

gctgaagttc accaacaagc tcaaagtgga aggacatcca gactcctgcc aggaggagca 6300 

gggcttttgc aacgggctcc atgaagccat gagcctgagg ctcggggtgc cttcccagca 6360 

tgccagcccc accaatccac tgccgaggtt cagacat 6397 



<210> 181 

<211> 1149 

<212> DNA 

<213> Homo sapiens 
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<400> 181 

tttttttttt ttttttgggg ccaactgaat tcaacaagta tttattgagt gtctattatg 60 

tgctagatac tgagacacat cagagaacaa aaccaaaagc cctgcccttg tcgggcttac 120 

agtctagcac ttaccgccag ttaacctgca ggctacctgg agccccgggc aagtcaccgc 180 

acctctgtgc cctcggtcct cagctgccca atgggagaat aagcagacct ggctcaaaac 240 

atgaatcatg tgccttggtg taactgcaag atgcccaaac tgcatccccc acaacccacc 300 

acgtaagaca gcagacaggg ctggaagttg atttttaatg aataaagtac aatgaaggga 360 

gggcagaggg gctaagccta gctgtcccgg ggtgctgtgg tggtggtaaa atggcctaac 420 

acaaactgtt gctgctgctg ctgcttcttg gtgcgtcgcc ttgctggcga gggtccttgg 480 

ccttctctgt aagctgccag tgccgtctcc tttgccttct ccttggcttc cttggctgtc 540 

tcaacaagtg ttttggaagg ggccgtcgcc ttgcagctta gccaagatat attcaaaacc 600 

cttcatagtc ttggtcacgt tgcttttgaa tcgggcaaga ccaaattcct gggacagctc 660 

tggagacacc aaattaagct agaggagacc caggcttccc cggcgggatt tcagtccagc 720 

ccactgttgg tcagagttca caacagtaaa acacatcgtt tcctccacca ccatcagccg 780 

gggcgtgggt tgatgttcca gttgaaagta gtcaaggtct gattctgtgg gtccacaata 84 0 

gagtcctcca ggacgtacac cgagtgagca acattggcag gaaatagtcg ctcgggccca 900 

gcgtgggcat tcctgttggt cttggtcaga gtcgcccggg acagcagttt ctggtcaggg 960 

gtcacctccc ggtgtactat gtcttccgtc aagacatgtt tgctataggg attcgggtac 102 0 

ccgctgccag aaggcggcga acacttggtc ccaggaactc cggagcacgc tctggcccag 1080 

gaaatacttc accatcgtcc cggccgaagc gggctcagga attacgcctc tctacctagg 114 0 

gtgtcaaag 1149 



<210> 182 

<211> 2526 

<212> DNA 

<213> Homo sapiens 



<400> 182 

tttcgtcgct gaccagcagc atggaggacc ccgccgctcc tgggaccggg ggcccgcccg 60 

caaatggcaa tggcaacgtg tggcggcaaa gggaagcagg cggcgcccaa gggccgcgaa 120 

gcgttccgaa gccagcggcg ggagtcagag ggctctgtgg actgtcccac tctggagttt 18 0 

gagtatggag atgcagatgg gcatgcagcc gagttgtcag aattgtatag ttacactgag 24 0 

aacctggaat tcaccaataa caggaggtgc tttgaagaag atttcaagac tcaagtgcag 300 

ggcaaggaat ggctggagtt ggaagaagat gcccaaaagg cctatataat gggactcttg 360 

gaccggctag aggtggtcag tagggaacgg cggctgaagg cggcccgggc tgttctctac 42 0 

ctggcccaag gtacttttgg ggaatgtgat tcagaggtcg atgtgctaca ctggtccagg 48 0 

tacaactgct tcctgctgta tcagatgggg accttctcca ccttcctgga gctactccac 54 0 

atggaaattg acaacagcca ggcctgtagc agtgcccttc ggaaaccagc tgtctccata 60 0 

gctgatagca cagagctcag ggtgctgctg agtgttatgt acctaatggt ggaaaatatt 660 

cgcctggagc gagagacaga cccctgtggg tggagaacag cccgggagac cttccgcact 72 0 

gaattaagct tctccatgca taatgaggag ccttttgccc ttttactctt ctccatggtt 780 

accaagttct gcagtggcct ggctcctcac ttccccataa agaaggtcct gctcctgctc 84 0 

tggaaggtgg tcatgtttac cctcggtgga tttgagcatc tgcagactct caaagtacag 900 

aagcgggcag aattgggcct gcctccactg gctgaagaca gtatccaggt ggtgaagagc 960 

atgcgtgctg cctccccgcc ctcttacact cttgacctgg gagagtctca gctggcaccc 1020 

ccaccctcca agctgcgagg ccgccgtggc tctcgaaggc aactcctcac taagcaggac 1080 

agcctggaca tctacaatga aagggatctc ttcaagactg aggagcccgc cacagaggag 1140 

gaagaggagt ctgctggtga tggagaacga accttggatg gagagctaga cctgctagag 1200 

caggaccctc tggtgccacc tccaccctca caggcacccc tctctgctga gcgggtggct 1260 

tttcccaagg gcctgccctg ggccccaaag gtcagacaga aggacattga gcacttcttg 132 0 

gagatgagca ggaacaagtt catcggattc accctggggc aggacacaga tacattggtt 138 0 

ggattaccca ggcccatcca tgagagtgtg aagaccctaa agcagcacaa gtatatctcc 1440 

atcgcagatg tgcagatcaa gaatgaagag gagctggaga agtgccctat gtctttgggg 1500 

gaagaggtgg taccagagac gccatgtgaa atcctctacc agggaatgct gtacagcctt 1560 

ccgcagtata tgatcgctct gcttaagatt ctgctggctg cagctcccac ctctaaggct 1620 

aagacagact ctatcaatat cctggcagat gtcctacctg aggagatgcc catcactgtt 1680 

ctccagagca tgaagctggg catcgatgtg aacaggcaca aggagattat tgtaaagagt 1740 

atctctaccc tgcttctgct actcctcaaa cacttcaaac tcaaccatat ctaccagttt 1800 

gaatatgtat cgcaacattt ggtatttgcc aactgcatcc ccttgatcct gaagttcttc 1860 
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aatcaaaata tcttgtcata catcactgcc aaaaacagca tctcagtcct ggattatcct 1920 

tgctgtacca tccaggattt gccggagctt actactgaaa gtctggaagc tggagacaac 1980 

agccagttct gctggaggaa cctcttttcc tgcatcaacc tcctgaggct gctcaataaa 204 0 

ctgaccaaat ggaaacattc ccggaccatg atgctggtag tgtttaaatc ggcaccaatc 2100 

ttaaagcggg ccctcaaggt caaacaggcc atgctgcaac tttatgtcct aaagctacta 2160 

aagttacaga ccaagtacct ggggcgccaa tggaggaaaa gcaacatgaa aaccatgtca 2220 

gccatttacc agaaagtgcg tcaccgcatg aacgatgact gggcttacgg gaatgacatc 2280 

gatgccagac catgggactt ccaagcagaa gaatgtacct tgagggccaa cattgaggct 2340 

tttaacagcc gtcgctatga cagaccccag gactctgagt tttcacctgt ggataactgc 2400 

ttgcagagcg tactggggca gaggttggat ctgcctgaag atttccacta ttcatatgag 2460 

ctctggctcg agagagaggt gttttcacag cccatctgtt gggaggagct gctccagaat 2520 

cactga 2526 



<210> 183 

<211> 1692 

<212> DNA 

<213> Homo sapiens 



<400> 183 

cccacgcgtc cggaagcact tgaggccaga ggaagacatg cgccttgtca gtgatccctg 60 

catccctagg ggggggcagc acgtcattcc tgtgagaggc tggactggag agagagactg 120 

ggttggcagc caggctggag aagctgtgac cagcctggaa cgccaggccc aggcaggtgg 180 

acttgaaact tactcacaag aagcaaagag ccctgttgga aaggtttgac atctacagga 240 

aggtgcccaa ggaccttacg cagccaacgt acaccggggc cattatctcc atctgctgct 300 

gcctcttcat cctcttcctc ttcctctcgg agctcaccgg atttataacg acagaagttg 360 

tgaacgagct ctatgtcgat gacccagaca aggacagcgg tggcaagatc gacgtcagtc 420 

tgaacatcag tttacccaat ctgcactgcg agttggttgg gcttgacatt caggatgaga 480 

tgggcaggca cgaagtgggc cacatcgaca actccatgaa gatcccgctg aacaatgggg 540 

caggctgccg cttcgagggg cagttcagca tcaacaaggt ccccggcaac ttccacgtgt 600 

ccacacacag tgccacagcc cagccacaga acccagacat gacgcatgtc atccacaagc 660 

tctcctttgg ggacacgcta caggtccaga acatccacgg agctttcaat gctctcgggg 720 

gagcagacag actcacctcc aaccccctgg cctcccacga ctacatcctg aagattgtgc 780 

ccacggttta tgaggacaag agtggcaagc agcggtactc ctaccagtac acggtggcca 840 

acaaggaata cgtcgcctac agccacacgg gccgcatcat ccctgcaatc tggttccgct 900 

acgacctcag ccccatcacg gtcaagtaca cagagagacg gcagccgctg tacagattca 960 

tcaccacgat ctgtgccatc attggcggga ccttcaccgt cgccggcatc ctggactcat 1020 

gcatcttcac agcctctgag gcctggaaga agatccagct gggcaagatg cattgacgcc 1080 

acacccagcc taatggccga ggaccctggg catcgccagc cttgcctcca gtgccctgtc 1140 

tcctttggcc ctcaatctgg tcccaaatct ggctgtgtcc caaagggtgt gtgggaagtg 1200 

gggggaaagt agaggatggc tcgatgtttt gcagctacct cttttccccg tgtttctttt 1260 

tagacaaatt agcactgcct gaagatgcag ttcccctttt tctggagagc cccaagaaca 1320 

gagtcaggca gggggtgggg agtccaggga tcttggggac ccctcctagg agagctgcag 1380 

tctcttccct caggggaaca tcccagaatg catatcgatc agctctcagc caggcttcga 1440 

caatctcgca gcccccacta tgtggacaca ttaatgattt ggtttctccc ctgggcagcc 1500 

aacctgcccc agaggcacca aacctgggct ttctagcttt tgggaccagg ctgcccaaag 1560 

gtactccttt atacacccgg caccttccac ggagatgggt actttcccaa gcaagcccct 1620 

atgatttgtc actatagatg gaaccctgac ttctgcccca tcccttcctg cccaacctag 1680 

aacccaggcc tg 1692 



<210> 184 

<211> 2533 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_jE eature 
<222> (1) . . . (2533) 
<223> n ss a, t, c or g 
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<400> 184 

acagcttttt ttgttttttt tttttcttat acctaaaaca gggatatcfcg catttaattt 60 

cattcagtaa atcaattcca gttaacttac catctcttaa aaatgggaga aagcaaacat 120 

aggacgtgaa aagttaaaga tgcgtgacat actggagagt aataagacat actggagagt 180 

agtaagtgca cacgtggcaa ttaagggttg taaatttaag atgtaaacac aaggaaaaaa 240 

agtaaaaatt actgtaactt tattgcctgt atcctatgcc tttcccccag taatagccta 3 00 

naaataccta caagggaggc ccccaccccc aaaagaagaa aaagtgccca ccttctggtt 360 

taaaactgtg gcaccttgat ttctttttga aatgtcactt tataaggtgt atgtagaaag 420 

caaccagcag ccgtttacaa aatgtttgtc tgagttgatt ctgagagctc caaaacaagg 480 

gatccgcgta taggctgaag aaaaaagacg gttcaggtta accagtgcgc cgctgtagga 540 

acttttaaca caagtcttcc aggtggaatt tcctgtgtaa accctaagta agaagatgcg 600 

actcacggag gaccaaaagg taaaaatctc ttttaccggt ttacaagttt aagtgagggt 660 

gggtctgcat tctccgcaaa acgactttac aaaggtacaa ggaaatgttg cgtgtgaggt 720 

attaggcata gaatatttca ttttccttac cggaagggag cagcacgggg acaggaaacc 780 

tagtggacgc cccgccacca cctttcccga agatgcacac cccccaggga acgggcagtg 840 

caaagcctgg gcagcaagcc acccttacaa ttctttttgg ttgtttaaaa tacaagggct 900 

ttctgacatc ctcaccctag gcgtgtgagc cccactcaac ctgggcaggg ntacaaccca 960 

cgtcctcgtt tagttttccc ccttgggaat ttttttacag ggatgcctga cagatacggg 1020 

tacaaactag tatgaaatta atgatagttt gcgtttccag aaacggcccc cgtggttaaa 1080 

tataccaccc ccatttccaa agaaaggttc aaagttaagg gtgactcaga agcattggca 1140 

tcccacagcc gcaagaaaac aggcttctcg ctgaactggg gtggaatcgc ccgactctgt 1200 

ccctcaacgg tggtgggggc cttggcaggt ggggggcgtc acatccacag tgagtgagga 1260 

ggcagcgtct gcctttgcag tagttgtggc agggtcagag ggtgcgttgc tggtgggcgg 1320 

gagggaagac agcagctctc ctgagcttgc tgcagaagag ctctctggaa ccaaggggaa 1380 

tgacgggagg aactcggatg gccaggggga agaggtgcaa tgcccaggta agtttcgggg 1440 

aggcatggaa ggaagaggtg gcaaaacccc gggtttaaca aattcctggt aaagcccaac 1500 

ccccggccat gaatgtgtgg tgcctgggaa ggttgaaggt tgaaggttgg ggaaggttgg 1560 

gcagtccctg cctggtaaaa ggcatcagac ccggtaaatg taagtagctg gattccattg 162 0 

gtggcacaga agtgagggac tggtttactt gtggtggcaa taactggtac tgttggtaca 1680 

cctgtactga gaacactact gacagctggt gggagttgag ctaatagaaa gtccagtcag 1740 

actctgttca attcctgtag ttcctggtgg tgacaaagac gggggattaa ctgaggacag 1800 

ctggacctct gtaaacccat ctttaagagg tgtaataggt gtaccagcca tttgtcctgg 1860 

aagagaaatt ttctttcctt cctcaaatgg gcgtgbaggt attcgatgca aataaccata 192 0 

tccaatgcca catcctaggc tgccttctcc accccatgca gaatttggtg taataatcac 1980 

ttctcgaccg ttactcacgt gtctgtgttg tacacataca gtttcaatgg ttttgcttca 2040 

tgtgtttcga taaggctgaa tagatcttca gactcattca tgactgtatc tgctccaatt 2100 

atataatcac tgtgtggtct aagacctgcc agtgctgcag gagaatttga ttccacctcc 2160 

agcacatgcc aaacattttc atttgcccca tcaaagctgc agaaacgaat gctcactccc 2220 

aataagccct ggccgcccca caggttactt ggtgtgactg aggtctctcg cagttccaat 2280 

gttttgctgc tatagataag catctttaca ggcttttcaa cgtttgcttt cagcagatcc 2340 

ttaagagtgt cattgtcttt atttaatctt gaaccattaa tagaaacaat aaaatcaaag 2400 

aaaggctcca aaccagctct gtgtcctggg gaattttctt gtacctcctt caggtagcca 2460 

ggtcgtattc atgcttattc cgttgtcgta ttccgttgct gcttttgccc cttacgtgta 2520 

ctccctctga aac 2533 



<210> 185 
<211> 906 
<212> DNA 
<213> Homo sapiens 



<400> 185 

ccgggggccc ccattttaaa ttggtttccc ccacaaggga aaggcccccg ggcaaatgtg 60 

acataaaatc tccttttggg aggggggggg aaaaagaaaa aacccccccg gacccggggg 120 

ccttttcccg ggctttccca accgggcctt ttatggcttt tcccaaagag gcctggttgc 180 

agtcatttgc cctccacccc cattaaggaa atgggtttgc caaaaattca ccatagggtg 240 

ggatgggaat cattttcggg tgtcttctta gaagttgact tcaaaatcta caaaaagaaa 300 

atgaatgagt ttttctccgt agacgataat aatgaagaag aagaggatgt tgaaatgaaa 360 

gaagattcag atgagaacgg tccagaggag aagcaaagtg tggaagaaat ggaagagcag 42 0 

agccaagatg cagatggtgt caacactgtc actgtgcccg gccctgcttc agaagaggca 480 

gttgaagact gtaaagatga agattttgca aaggatgaaa atattacaaa aggcggtgaa 540 
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gtgacagatc attctgtgcg tgaccaagat catcccgatg gacaagagaa tgattcaacg 600 

aagaatgaaa taaaaattga aacagaatcg cagagctcat atatggaaac agaagaactt 660 

tcatcaaacc aagaagatgc cgtgattgtg gagcaaccag aagtgattcc attaacagag 720 

gaccaagaag aaaaagaagg tgaaaaagct ccaggcgagg acacacctag gatgcctggg 780 

aaaagtgaag gctccagtga cctagaaaat actccaggtc ctgatgttga aatgaatagt 840 

caggttgaca aggtaaatga cccaacagag agtcaaccct cgtgccaagc atgacgtagt 900 

cggggt " 906 



<210> 186 

<211> 8B3 

<212> DNA 

<213> Homo sapiens 



<400> 186 

atgggctccc ccaacgtgtg gcctccagtc tcgggcctgt gcattgaggt gctcgtggaa 60 

gacagtatct gccggctctc ttcccagcag cctcctccca tctcccttcc ccgaaggcag 120 

caacatcccc agcttctaca cagtcgcacc ataggcaagc gtccttggac ctcgccactg 180 

tccccagagc aactcttgga gaacaacagt gcagataaag cctctggttc tgtgtccacc 240 

acgttgccgg ttgcttataa aaagggagtc aaacatcttt ctgctcttct tgatccagag 300 

gtcctgtcca ttttcgtgcc tccttttatc agtaaagagg acagtcaaac ggccggtgcc 360 

aactgcggca ctctcggtaa aacccggatg cgctccttga gaaagaagag agagaagccc 420 

agaccagagc agtgggaggg cctcccgggg ccccccagag cgccagagcc tgaggatgtc 480 

gccgtcccgg gcggcgtgga cctcctcacc ctgccgcagc tgtgcttccc agggggtgtg 540 

tgcgtggcca ctgaacctaa ggaggattgc gtccacttcc tggtgctgac cgatgtctgc 600 

gggaatagga cctatggcgt ggtggcccag tactaccggc ccctgcatga tgagtactgt 660 

ttctacaatg gcaaaaccgc accgggagtg tcctggctgc ttcgtgccct tcgcggtgtg 720 

cgtggtctcc aggtttccct attacaactc cctcaaggac tgcctttcct ggcactaccg 780 

cagacctggg gccactctgc tgtcacccag cagcttggtg actttgcttt tggtgaaagg 840 

tcctggagct gctgctgcag acgctgggga gattccagtg tga 883 



<210> 187 

<211> 1464 

<212> DNA 

<213> Homo sapiens 



<400> 187 

tgtagactcc tgcatagcct tccaaagttc ttcatttggt gaatattatg agaaatgaca 60 

ccgaggtgac tggcacatgg tcgaggcagc tgcttaggca gcttaagcat gccctgtgaa 120 

acatccctgt gggggactcc gaccagccca ggcaacgcgg atcctgagaa cgctcccggg 180 

taggcatctg cccctgtggg atgcctcacc agagcagtgt gtggcaggcc cccgtggagg 240 

accaacacag tggttgaacc ccgggaaggg acaaggatct tggagtttgg acatctaaaa 300 

cttgcacatg taccacccct agaatttctg gtaaaccagc accagcctga agatcacgtt 360 

ctcatcaaac ggtggaaaga agaaaaactc gagccagcct gggaaggacc ctaccctgtg 420 

ctgctaacca ccaagactgc tgttcgtaca gcaaaaaaaa aaaaaaaaaa aaaaaaagat 480 

ggactcatca cacccaagtc aagaaagtgc caccccctcc agagtcgtgg gccatagtcc 540 

caggggaaaa ccctaccaaa ctaaagctaa gaaaaatgta actcttttca tctattctat 600 

tactcttctt tctttcctcg tgctattgct gaccatctag ttattaacat aaccaagtca 660 

atttcacctc aaactattgc atttaatgct tgccttgtta taccctgtgg ggacttgcca 720 

agtcaaagac agctctctac ttcagaaaag tacttctgtc cctcctgact ctcctcagac 780 

tggacattag taaactagga ccatttaatc cagggagatt tcgataaaga ccccagtgcc 840 

aaccaggagt cttgcccccc gatgtagaac tttcatgcca tagttggtcc aacgttctgt 900 

ggaccactgc agccaaaacg ccaattttct gctcctactg agaggtgaag ccagctggac 960 

ttcctgggtt gagtggggac ttggagaact tttctgtctt acaagaggat tgtaaaatgc 1020 

accaatcagc actctgtagc taggactgta aaacacacca atcagtgctc tgtaaaacgc 1080 

accaatcagc agaatcctcc ccgtctgtat gcagccacca tgcagcccac ctaggctgtg 1140 

ggctctgttg atgccgcctt ggagtgtgcg gcttgctctg ggtgggcctt ccaggggagg 1200 

agggaagttg gaactggcaa tcgagccgta gagagagagc aaatgggaag cagcttgtgt 1260 

tccccaggcc tggactagca tgcccttcct ggcatccatt tttcctgctg ggccccaagg 1320 
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gacctggccc tgtgcgtcag ttctgtggct gctagacctt gggctctgtg gtcgggtctt 138 0 
tggacatgac atctgcacgg gctgggggct gaggctgctg gccagggcga gccagggcag 1440 
gccaccaggc tgcagaaccc acat 1464 



<210> 188 
<211> 4095 
<212> DNA 

<213> Homo sapiens 



<400> 188 

atgaggcgag gtggatggag gaagcgagct gaaaatgatg gctgggaaac atggggtggg 60 

tatatggctg ccaaggtcca gaaattggag gaacagtttc gatcagatgc tgctatgcag 120 

aaggatggga cttcatctac aatttttagt ggagttgcca tctatgttaa tggatacaca 180 

gatccttccg ctgaggaatt gagaaaacta atgatgttgc atggaggtca ataccatgta 240 

tattattcca gatctaaaac aacacatatt attgccacaa atcttcccaa tgccaaaatt 300 

aaagaattaa agggggaaaa agtaattcga ccagaatgga ttgtggaaag catcaaagct 360 

ggacgactcc tctcctacat tccatatcag ctgtacacca agcagtccag tgtgcagaaa 420 

ggtctcagct ttaatcctgt atgcagacct gaggatcctc tgccaggtcc aagcaatata 480 

gccaaacagc tcaacaacag . ggtaaatcac atcgttaaga agattgaaac ggaaaatgaa 540 

gtcaaagtca atggcatgaa cagttggaat gaagaagatg aaaataatga ttttagtttt 600 

gtggatctgg agcagacctc tccgggaagg aaacagaatg gaattccgca tcccagaggg 660 

agcactgcca tttttaatgg acacactcct agctctaatg gtgccttaaa gacacaggat 720 

tgcttggtgc ccatggtcaa cagtgttgcc agcaggcttt ctccagcctt ttcccaggag 780 

gaggataagg ctgagaagag cagcactgat ttcagagact gcactctgca gcagttgcag 840 

caaagcacca gaaacacaga tgctttgcgg aatccacaca gaactaattc tttctcatta 900 

tcacctttgc acagtaacac taaaatcaat ggtgctcacc actccactgt tcaggggcct 960 

tcaagcacaa aaagcacttc ttcagtatct acgtttagca aggcagcacc ttcagtgcca 102 0 

tccaaacctt cagactgcaa ttttatttca aacttctatt ctcattcaag actgcatcac 1080 

atatcaatgt ggaagtgtga attgactgag tttgtcaata ccctacaaag acaaagtaat 1140 

ggtatctttc caggaaggga aaagttaaaa aaaatgaaaa caggcaggtc tgcacttgtt 1200 

gtaactgaca caggagatat gtcagtattg aattctccca gacatcagag ctgtataatg 1260 

catgttgata tggattgctt ctttgtatca gtgggtatac gaaatagacc agatctcaaa 132 0 

ggaaaaccag tggctgttac aagtaacaga ggcacaggaa gggcaccttt acgtcctggc 138 0 

gctaaccccc agctggagtg gcagtattac cagaataaaa tcctgaaagg caaagcagca 1440 

gatataccag attcatcatt gtgggagaat ccagattctg cgcaagcaaa tggaattgat 1500 

tctgttttgt caagggctga aattgcatct tgtagttatg aggccaggca acttggcatt 1560 

aagaacggaa tgttttttgg gcatgctaaa caactatgtc ctaatcttca agctgttcca 1620 

tacgattttc atgcatataa ggaagtcgca caaacattgt atgaaacatt ggcaagccta 1680 

cactcataac attgaagctg tcagttgtga tgaagcgctg gtagacatta ccgaaatcct 1740 

tgcagagacc aaacttactc ctgatgaatt tgcaaatgct gttcgtatgg aaatcaaaga 1800 

ccagacgaaa tgtgctgcct ctgttggaat tggttctaat attctcctgg ctagaatggc 1860 

aactagaaaa gcaaaaccag atgggcagta ccacctaaaa ccagaagaag tagatgattt 192 0 

tatcagaggc cagctagtga ccaatctacc aggagttgga cattcaatgg aatctaagtt 1980 

ggcatctttg ggaattaaaa cttgtggaga cttgcagtat atgaccatgg caaaactcca 2040 

aaaagaattt ggtcccaaaa caggtcagat gctttatagg ttctgccgtg gcttggatga 2100 

tagaccagtt cgaactgaaa aggaaagaaa atctgtttca gctgagatca actatgggaa 2160 

taaggtttac tcagccaaaa gaggcagaag cttttcttct gagtctttca gaagaaattc 2220 

aaagaagact agaagccact ggcatgaagg gtaaacgtct aactctcaaa atcatggtac 2280 

gaaagcctgg ggctcctgta gaaactgcaa aatttggagg ccatggaatt tgtgataaca 2340 

ttgccaggac tgtaactctt gaccaggcaa cagataatgc aaaaataatt ggaaaggcga 2400 

tgctaaacat gtttcataca atgaaactaa atatatcaga tatgagaggg gttgggattc 2460 

acgtgaatca gttggttcca actaatctga acccttccac atgtcccagt cgcccatcag 2520 

ttcagtcaag ccactttcct agtgggtcat actctgtccg tgatgtcttc caagttcaga 2580 

aagcttaaga aatccaccga agaggagcac aaagaagtat ttcgggctgc tgtggatctg 2640 

gaaatatcat ctgcttctag aacttgcact ttcttgccac cttttcctgc acatctgccg 2700 

accagtcctg atactaacaa ggctgagtct tcagggaaat ggaatggtct acatactcct 2760 

gtcagtgtgc agttcgagac ttaacctgag tatagaggtc ccgtcacctt cccagctgga 282 0 

tcagtctgtt ttagaagcac ttccacctga tctccgggaa caagtagagc aagtctgtgc 2880 

tgtccagcaa gcagagtcac atggcgacaa aaaagaaaga accagtaaat ggctgtaata 2940 

caggaatttt gccacaacca gttgggacga gtcgttgttg caaataccga gaacctcaag 3000 

aatcgaacag tgacgcagga ataaatttaa tagcccttcc agcattttcg acaggtggac 3060 
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cctgaggtat ttgctgccct ttcttgctga acttcagagg gagctgaaag cagcgtatga 3120 

tcaaagacaa aggcagggcg agaacagcac tcaccagcag tcgagccagc gcatctgtgc 3180 

caaagaatcc tttaatatca tctaaaggca gcgagtgaaa gaaaagaaaa gaaacaagaa 3240 

gaaaaaaacc attggttcac caaaaaggat tcagagtcct ttgaataaca agctgcttaa 3300 

cagtcctgca aaaactctgc caggggcctg tggcagtccc cagaagttaa ttgatgggtt 3360 

tctaaaacat gaaggacctc ctgcagagaa acccctggga agaactcttc tggcttctta 3420 

ctttcaggtg tgccagggcc tttctagttt gcagtctgac cccagctggg ctgtgtgaga 3480 

cctccaccac cccaatctag gctggagctg ttgaattcaa tgatgtgaag accttgctca 3540 

gaggaatggg taactacaat ttcagatcca atggaagaag acattctcca acgttgtgaa 3600 

atactgtact gatctaatag aagacaaaga tttggaaaaa ctggatctag ttataaaata 3660 

catgaaaagg ctgatgcagc aatcggtgga atcggtttgg aatatggcat ttgactttat 3720 

tcttgacaat gtccaggtgg ttttacaaca aacttatgga agccacatta aaagttacat 378 0 

aaatattacc cagagagcct gatgcctctc tgatagctgt gcccataagt gcttgtgagg 3840 

tatttgcaaa gtgccatgat agtaatgcct cggagttttt ataattttaa atttctttta 3900 

aagcaagtgt tttgtacatt tcttttcaaa aagtgccaaa tttgtcagta ttgcatgtaa 3960 

ataattgtgt taattatttt actgtagcat agattctatt tacaaaatgt ttgtttataa 4020 

agttttatgg atttttacag tgaagtgttt acagttgttt aataaagaac tgtatgtata 4080 

ttttgtacag gctcc 4095 



<210> 189 

<211> 1536 

<212> DNA 

<213> Homo sapiens 



<400> 189 

atggctgaga cggaggagcg gagcctggac aacttctttg ccaagaggga caagaagaag 60 

aagaaggagc ggagcaaccg ggcggcgagt gccgcgggcg cagcgggcag cgccggcgga 120 

agcagtggag ccgcgggtgc ggcgggcggc ggggcgggcg cggggacccg gccgggtgac 180 

ggcgggaccg ccagcgcggg ggctgcgggc ccaggggccg ccaccaaggc tgtgacgaag 240 

gacgaagatg aatggaaaga attggagcaa aaagaggttg attacagcgg cctcagggtt .300 

caggcaatgc aaataagcag tgaaaaggaa gaagacgata atgaaaagag acaagatcca 360 

ggtgataact gggaagaagg tggaggtggt ggtggaggta tggaaaaatc ttcaggtccc 42 0 

tggaataaaa cagctccagt acaagcacct cctgctccag taattgttac agaaacccca 480 

gaaccagcga tgactagtgg tgtgtatagg cctcctgggg ccaggttaac cacaacaagg 540 

aaaacaccac caaggaccac cagaaatcta ccagtgatac cacagttccc atccccttgc 600 

agtcaacttg ccaaagcatg tagaaagccg gaaggataaa gaaatggaga agagctttga 660 

agtagtaaga cacaaaaata gaggtaggga tgaggtttca aaaaaccagg cccttaaact 720 

tcagctagac aaccaatatg ctgtgcttgg aaaatcagaa aagcagccac tcacaataca 780 

attaaggaat gggctttgct aacccttctg aggtaactag actgcagcta aaccaccacc 840 

aacagccatt catcatctga tctctgctgg atctacagac accgatgcag gaccactcgg 900 

atttccatga ccggccctat tgcacttatg gaagttaaag tgtcacgacc gcccctatgc 960 

catattggat ttaggggaat tttccattgt tacataaatg tgtgaaccta gtttccaaca 1020 

gtgttccttt catatttact ctgcaaattc caaaaaacca aaacctgcag cccagtgggt 1080 

ccatttccaa aatcttttta tggttcagat actgaggcct ttcataaggg ttgactacct 1140 

cagatttgct gcactcattg tggacttcat gtggatcaca acttctggat aagaagatta 1200 

caactattaa gtgtcgatgt gaaccttgca accagctcta ctggattctt atcagaaatc 1260 

ctgcataaaa agtcagccat ctgggttctg atctgctgta aaagatgaag atttaagtga 1320 

ccttaattaa cctgtcctgt gccctaccct taaggaatac tcagtatctg aacataaaaa 1380 

gattttgaaa tgaccactgc ctgcaggttt tggttttttg tatttgcaga gtaaatatga 1440 

aagaccactg ttgaaactag ttcacacatt tatgtaacaa tgaaaattcc cctaaatcca 1500 

atatgcatag agcagtcgtg acactttaac ttccat 1536 



<210> 190 

<211> 3498 

<212> DNA 

<213> Homo sapiens 



<400> 190 
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caaggcccaa gaagcacttg ggtccccttg ttattgaacc tagagaccag gaatcacaag 60 

acaactacct ttttgagcac caaaggcagt gaggtatttc ctgatgaccc ttcgatattc 120 

ttagcacttt catatctcaa gtcttctagc aaaatgtcaa aaacttattc tccctttttt 180 

ctccagcctc atgtcttacc caagcagtta atctgcagct cctttgtctg ggcaccatcc 240 

aggatggaaa ggaagagggg aaaaagcttc tcacggaagt gtcctgaaag catctcagct 3 00 

gccactggtg aggagtatag cttcccataa aggtaacacc caaatggcag tatgataaaa 360 

gcaggtaaaa gggtagtggc agctcctccc acaggtcact caccatgctc cattacaagc 420 

tcgtctgcca gagcagggat gaaagggttg acccagctct gacatctgag gattctgaga 480 

gactcaagaa cccatagcta ttttctgacc tctttaagag gcgagaacaa cccctggacc 540 

gaccaaagcc cgtgtgccgc tgcatcccgc gcacagcacc tacatcccgc tgccgtcgct 600 

gctgccacca tgcccaagac aaaggctgaa ggtgatgcta aaggagataa agccaaggtg 660 

aaggacgaac cacaagtaac cagagctgca attcagacta atactttcat cttcaagtgc 720 

tagatagagc cccaaaagca gatatacata ctatacattc aaaactcatg tcaaatttcc 780 

ctcttaatct taccaaaaag cacacttatg aagtggatgc agactctctg atcctgagac 840 

accagttgcg gcctggccgc gtttgctgag ggaaatggcc aaacccaggt ■ cttctgaaga 900 

aaaggttccg cccgcaccga cacgccgggg tccggcagca gctccagcgc gcaggccagg 960 

cacaggcggg gggtcacggg cagcagccag cgcgggtcgt gccggtaatg ggccctctcg 102 0 

aatagaagcg ccgcctcttc ctctctcctg ggcccgggag tgcccgccgt cgccgcctgc 1080 

ctcacagcca tctcctcgct tgccctcagc tgaggcgggt tcccaggtgc ctcctggccc 1140 

tggggactct ctgtcttagg aagggtggcc ccagtacagc agtgaccttg cagctgaaga 12 00 

tccctctgtg aagcagtgtg atctccgatg ctgtcctccc ctgcacgagg atcacccact 1260 

atccccactc tatgaatccc tggcaggggt tgtgcccctg gctggccaga tgctctccct 1320 

ccccacccac cagcaggaag acttcacccc agtgggggac aggattcatc tcaggacagg 1380 

gggcagagat gaaaccattc acagccactg cctgttcctg ttcccgggtc ccacatccat 1440 

agccagtgaa ggctggaggc catggtaggc tgtgcttcct gcctgtgggc ctctcactcc 1500 

cagaacagtc gacttggaga ctgtccctca cagcatcctc tcctctgacc agatccttgt 1560 

ttcctcccct tggtgctcga ggctctccaa catcctcatc tgtgcacttt ctctggcttc 1620 

ctatatcctg gagttctgca acatcctcaa ctccaccgtc attcttattt ccttgccctt .1680 

gtacctggat ctcttcagca atttcaactc ttaactgtcc ctggtcatct cctgcaggag 1740 

cctgggtctc tgtaccatca tcagctgtgg tccctctcag ctttttaaca gtctgtttct 1800 

tcgctctaag cctccagttt tttcccctaa gcctccagat cattctaccg tcctcgtttc 1860 

caacctctct ctggtttctc ttctctgatg cctgattttc tgcaacatcc tttccaatca 1920 

gctcccagtt cctcctccct ggtgaatggc tccctccaat ttcactatct aactggtcct 198 0 

cgttctctct cttaggtctc tgagtcttta caccgtcctc acctccaggt tctctctggt 2040 

ttctctccac tggagcctga atttctgttc cattctcact tcccatctgc ttctggctct 2100 

tccactctga taaaaggatc tttctaccaa gatcacttct aaactgactc tgggtttctt 216 0 

tcccaggtgc ccaggtcttt acaccgtcct cacctccaac ttctctcagg tttctcttcc 2220 

cttgtatctg aatttctgcc ccattctcac ctccaatctg ctgatctttc ttccattctg 22 80 

ctgaatgcat ttctccatca atttgacttg ttaatttgcc ccagttccct gactcagata 2340 

tctgggtatg tgcaccatct tcgcctctaa tatgttcctg attcttcccc tttggtatgt 24 00 

gaatctttcc actaaattca tttcctaact gaccctgatt tttcctccca agttcctggg 2460 

tgtctttatc atcctcacct ttaacctctc tcctccctag tgactgaatg tctgtaccat 2520 

cctcttctct aagtggctca ttgctctgag tctccacatc agttatcacc ttttaattgt 2580 

tcctggatct cagcctcggg tgccaggata tttgtgccat cctcacatga ttctcccaac 2640 

atagtatctc aaattctcta taaatttcac ttgctacttg tctccggttt ccccaccata 2700 

gtgcctggat caccacccaa tccttctctt atctccctga gcttcttctt ggtagctatc 2760 

tgggtttcct tagcatcttc atctcttctc aggcccttat ttccccaccc cacttgaatt 2820 

tcatgactta attgtttctg gttctctccc ccagatgctt gggtcttctt agctttctta 2880 

cttccacctt gatcatgctt ctcccacctt ggtgtctgga tctctgtaac attcccattt 2940 

ccaacacatt cctgttttcc ccaatccaag gcctgggtct ctacactatt ctcttttttt 3000 

acttctccca cgttctccct cccacatgat tgggtttgtg cagcatcctg acctccaacc 3060 

tgatccttgt ttctcccctc ttccatctga atttccacat ccgcatcacc tttcatctga 3120 

tcttgtttct cccactgtgg tgtctgggtc tcctcaccat tctcatctcc agtctggcct 3180 

tgcttcccca atccttgagc ctggatctct tccccatcct cctctccagc ctctttcttg 3240 

tccttcttct caaatgtctg ggtctccaca gcatcctcac ttccattctt atcctggttc 3300 

tcccactcag gtgtttgggt ctctgcatca attttacatc ttaactgttc ttggttctct 3360 

cccatatgtg cctgggtttc agggagattc tcacctccag cctccctctg gtttctattc 3420 

cctagttcct gaatgttggt tctactctca actcccatcc ggtcctgggt gtcccaccct 3480 

ggtgtcaaga tctcgcca 3498 



<210> 191 
<211> 5410 
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<212> DNA 

<213> Homo sapiens 



<400> 191 

attcatcggt agaacaagca tcggttgaag tacctgatgg accaacactc catgacccag 60 

acctctatat tgagattgtg aaaaatacga agtctgtccc agaatattca gaggtggctt 120 

atcccgatta ttttggtcac attccgcctc cattcaaaga gcctatttta gaaaggcctt 180 

atggtgtaca aaggacaaaa attgctcaag atattgaaag gctaatacat cagagcgata 24 0 

tcatagatcg tgtggtatat gacttggata acccaaatta caccattcca gaagagggag 300 

atattttgaa atttaactcc aaattcgagt ctgggaatct gcgcagagta attcagatta 360 

gaaaaaatga atatgatctt attctgaact cagacataaa cagcaatcat tatcatcagt 420 

ggttttactt tggaagtcag tggaatgcga ccaggtgttg cttacaggtt taacatcatt 480 

aactagcgaa cagttgcaac aggctgttta attatggtat gcaaccactc atgtattcgg 54 0 

ttcaggaagc attaaatgcc agaccatggt ggattcgtat ggggactgac attcgttact 600 

atataaatca tttctcaaga agttcagttg ctgcaggtgg ggcaaaaggg gaaaatccta .660 

ctatacaatt acatttactg ttcaattttc cacataaagg atggatgttt gctactttgc 720 

tttattcact atccatatac gtatttcaac tttacagatg catcttcaaa aattggaatc 780 

agcacacaat cctcagcaaa tctattttcg gaaagatgtg ttatgtgaaa ccctgtctgg 84 0 

aaacagctgc cccttggtga ctataacagc aatgccagag tctaattatt atgaacatat 900 

ctgccatttc agaaatcgcc cttacgtttt aatgtatgct cgggtacatc ctgggggaaa 960 

cttaatgcaa gttggggtta tgaaagggaa cgttgggaat atctccatga ggcaattaac 1020 

cccactggct tcaggagctt acggaggaat ctttattatt ttttaaaatt gtccctatgt 1080 

taaatccaga tggtgtcatc aatggaaatc atcgctgttc tttaagtgga gaggatfctga 1140 

ataggcagtg gcaaagtcca agtccggatt tacatcctac aatttaccat gctaaggggc 1200 

.tgttgcaata cttggctgca gtgaagcgtt tacccttggt ttattgtgat tatcatggcc 1260 

attcccgaaa gaagaatgta tttatgtatg gttgcagcat caaagagaca gtgtggcata 1320 

ccaatgataa tgcaacttca tgtgatgttg tggaggatac gggatacagg acattgccta 1380 

agatactgag ccatatcgcc ccagcatttt gcatgagcag ctgtagcttc gtagtggaaa 144 0 

aatctaaaga atccacagca cgtgttgtag tttgcacggg aaataggagt acaaagaagt 1500 

tataccatgg agagtacttt atgtggctgt gatcagggaa aatacaaggg tttacagatt 1560 

ggtacccgag aactggaaga gatgggagca aaattttgtg ttggtctttt acagtttgaa 1620 

aagactgacc tctccattgg agtataaatc ctgccctccc cagccctgcc ttgacctttg 1680 

aaaatgattt aaattgaatc caagcctgca aagtaactag cccctaccca cttatgtcct 174 0 

tggatgaaga tgaaccctcg attcccttga agaagttgat tacagtgcag aaagtaatga 1800 

tgagttagat attgagttgg ctgaaaatgt aggagattat gaaccttctg ctcaagaaga 1860 

agtactttct gactctgaat tatcaagaac atacctacct tgagcccgct cgccatctct 1920 

ttgttaactg caaagaataa atgaaatatc ttggtgtttt attctcccag gaagcttgag 1980 

agaaatgagt ttatacagag ctgactcaaa aagacaaaaa gtaacttggg ccagtttggt 204 0 

ttcaagataa taaatgtgtt attaattaat gattaaaatt ggcgcctggt tttattttcg 2100 

gatattccaa tgcactttat ggtagccatg gaatggatca aaatattggg atttaccctt 2160 

taaaaaaaaa aaccctgagt atcattggca tgaaatttta tcctccccta tggttatatc 2220 

cctgacatca agtggataat tttgaagtgt gttccagaat ataaaattga aattttagag 2280 

ttgttgaaaa tccctgactt gttgaaaact aatatatatg tacatggatt tctatagatg 2340 

tgtttgttta gaagtgggta gatattgcag ataagactgt tcttcagaat catgttaact 2400 

attgggttgt gactgaagta gtcccagggt ttgccttgaa acccattaca ttctacattt 2460 

accaaattaa acaaataaaa actgtattaa atgttgcatt cattttgtca tcttctttaa 252 0 

ccagctcaga ttatttgatg tatagaactt tgtgagaatg tgataaaaac ccagaattgg 2580 

acacagtgat aaaaagttgt ttttaagaaa gtgctgggat tacaggcgtg cgccaccgtg 2640 

cctggctaaa tcctttcttg taatgaaacc tacctgggtt ttagccctct tgctgcaaga 2700 

atgactttat ttttaaataa aataaagcaa atccattgac tttggataag tgttttaaga 2760 

actaagtttc tgatataata aaacaacctc caaaagatac ctcattgcag attttcgcca 2820 

ggattcaggt gcttaggttt tggcacatat cttcctcact ttttcctata gtgtttttaa 2880 

cctgttataa ttggaatgat agagatgtat ttcagaattt tgcatggtcc aaatcaatga 2940 

tttttctttt taatatgtaa agtatttcat taacagtgta gattctttct gatgtttggg 3000 

gagccatgtt taaaaatgta gtatggagca attgaaaaag gggtcttttt ccctagccta 3060 

attcttacta atctcagaaa acaaagatca aatagactgt gaagttgaac tagtcctctt 3120 

agtgtagtaa ccaaatttag gaaagatcag gataatttag gaaggaaagt aatcatttat 3180 

ttaattattt ataattataa acaattattt tatgttttgt atattttatg tataaaacaa 3240 

ttacatgttt tattatagtt atacttcttc agagagtaga ttatggagcc accatgtcca 3300 

tacagttagt tgttccttgg tattcttgga ttgattccag gaccccccac agttacccaa 3360 

atccacagat gctgaagtct ctgatataaa atggagtagt atttgcattt aacctatgga 3420 

catcctcctg tatgctttaa atcatctcta gattacttat aatacctaat acaatataaa 3480 

tataaatgct ctgtaaatag ttttattgta tagtgtgtgt gtgtgtgttt taagagacga 3540 
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ggactcacaa tgggacactt tgaccaggct ggfcctcgaac tcctgggcct acacaatcct 3600 

cccacctcgg cgtcgcaagt agctgggact acaggtactc accactatgc ctggcagttt 3660 

tttaaatttg cgtttttaaa aattgtttat tatttattga afcattttcta cctgagattg 3 720 

acttaatctg tggatatgga acctatagat atggagggct ggctgtatat gaatttgttt 3780 

ttggctttta caaaattaga aattaaaagt aactacagtt actataatgt gcagaatgac 3840 

tcaaatattt ccatttatta tgttagtgta aaaactccta actacgtatt ataatatttt 3900 

atattaatga agtactttag catacattat gagcctgaat aaagcttttt attaaatatt 3960 

cacaagattg aaggagaaat tctttatagt taggaaatga ttttttcata aacaataaat 4020 

agctttgaaa atataaaaaa aaatttaggg ctgggtgcag aggctcatgc ctgtagtccc 4080 

agcactctga gaggccgagg cgggtggatc acttgaggtg aggagttcaa gatcagcatg 4140 

gccaacatgg tgaaactcca cctccacgaa aaatacaaaa attagccagg catggtggca 4200 

tgcgcctgta atcctagcta cttgggtggc tgaggcatga gactcacttg aacctgggag 4260 

gcagaggttg cagcagtgag ccgagatcaa gccgctgcac tccagctggg gcgacaaagc 4320 

aagactgtct caaaaaataa aaacaaaaaa attaaaataa aatcctaact ttcctagtta 43 80 

gaaatataat taacttttat tggctattag tagctaaaat tgcccttatt tttgcttctg 4440 

tggcctcatt ttttttctcg gttattcagt atttaaatta tggcttagat gtcttcaatg 4500 

ctatagtcct gaaaattaat gatctacgtt gtgcagcagt gatgggatct gataaactga 4560 

gtctgatgat ggagtttaga tttatttcct ttaccttgca tttggaactt tgaatctctt 4620 

aaatgtggtt ttggtctagt ccactggttc tcacgttgta atgtgcatca gaatcttctg 4680 

aagacacaga ttgctgatta tgtagacctg gggtggaggt ccaagaatgt gcattccttg 4740 

caaattccca ggtggtgatg ctgcagagcc acactttgag aatcactagt ataatccagt 4800 

atgctcttcc agcatcacac cttccttcct gagaatcaca ggagttgtga actgcagatt 4860 

agcattgggg agaatttaga tcaaactaat ttgtagaatc aaggaggtca agtaaggtca 4920 

caggggcact tgggttgagc cagggtttta gcccaggtct tctgacaact gcctcatgtc 4980 

cttaccacaa aggagctgct atcctttgcc tttccccaaa gagtgaagac tgcttaaagc 5040 

tcaaggatct ttcttgaatt tgtgaaattt gttcaggcaa ggtgaaaagc aaaaacctat 5100 

ggttcacatt gactttttgt attgatcatt gtcttttgaa gacaggaagt atgatcagtc 5160 

tctgccactt gtgctagttt ttgtgtggtg tttagaaaca tgggcatttg tctggatcct 522 0 

aattacaaat aagtaaccta gaattctctt cgagatagtg gactaaccac cgatgaatct 5280 

attctaaaat ttcaaatatc caaattaaaa tggactgtat taggcattaa ggtactggaa 5340 

atgggataat acattaaatg tattaatgga attgtttttg tgcatgatac agaaataaat 5400 

gatagtaacg 5410 



<210> 192 

<211> 3704 

<212> DNA 

<213> Homo sapiens 



<400> 192 

atggttgcca ttatcactat catcatcatt atcaaggtag aaaccaaaga catagtggga 60 

tggagacaca ctgagctgag actcactttt ctcttcctga atttgaacca ccgtttccat 120 

cgtctcgtag tccgacgcct ggggcgatgg atccgtttac ggagtcctgg cgcagcaaga 180 

gtgaggcgca ggcctgcgga acgggtcctg ctggaagcag ctggaatgcc ctgcagggcg 240 

gggtccgggg ccggtgactc agtgcggctg ccgccgggaa aggcgaaact gctggagcga 300 

acccgtgcca ggcgagagaa tcttcagaga aaaatggctg agaggcccac agcagcfccca 360 

aggtctatga ctcatgctaa gcgagctaga cagccacttt cagaagcaag taaccagcag 420 

cccctctctg gtggtgaaga gaaatcttgt acaaaaccat cgccatcaaa aaaacgctgt 480 

tctgacaaca ctgaagtaga agtttctaac ttggaaaata aacaaccagt tgagtcgaca 540 

tctgcaaaat cttgttctcc aagtcctgtg tctcctcagg tgcagccaca agcagcagat 600 

accatcagtg attctgttgc tgtcccggca tcactgctgg gcatgaggag agggctgaac 660 

tcaagattgg aagcaactgc agcctcctca gttaaaacac gtatgcaaaa acttgcagag 720 

caacggcgcc gttgggataa tgatgatatg acagatgaca ttcctgaaag ctcactcttc 780 

tcaccaatgc catcagagga aaaggctgct tcccctccca aacctctgct ttcaaatgcc 840 

tcggcaactc cagttggcag aaggggccgt ctggccaatc ttgctgcaac tatttgctcc 900 

tgggaagatg atgtaaatca ctcatttgca aaacaaaaca gtgtacaaga acagcctggt 960 

accgcttgtt tatccaaatt ttcctctgca agtggagcat ctgctaggat caatagcagc 1020 

agtgttaagc aggaagctac attctgttcc caaagggatg gcgatgcctc tttgaataaa 1080 

agccctatcc tcaagtgctg aagatgcgtc tttggttaat gcctcaattt ccagctctgt 1140 

gaaagctact tctccagtga aatctactac atctatcact gatgctaaaa gttgtgaggg 1200 

acaaaatcct gagctacttc caaaaactcc tattagtcct ctgaaaacgg gggtatcgaa 1260 

accaattgtg aagtcaactt tatcccagac agttccatcc aagggagaat taagtagaga 1320 
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aatttgtctg caatctcaat ctaaagacaa atctacgaca ccaggaggaa caggaattaa 1380 

gcctttcctg gaacgctttg gagagcgttg tcaagaacat agcaaagaaa gtccagctcg 1440 

tagcacaccc cacagaaccc ccattattac tccaaataca aaggccatcc aagaaagatt 1500 

attcaagcaa gacacatctt catctactac ccatttagca caacagctca agcaggaacg 1560 

tcaaaaagaa ctagcatgtc ttcgtggccg atttgacaag ggcaatatat ggagtgcaga 1620 

aaaaggcgga aactcaaaaa gcaaacaact agaaaccaaa caggaaactc actgtcagag 1680 

cactcccctc aaaaaacacc aaggtgtttc aaaaactcag tcacttccag taacagaaaa 1740 

ggtgaccgaa aaccagatac cagccaaaaa ttctagtaca gaacctaaag aagtgatacg 1800 

tgaaattgag atgagtgtgg atgatgatga tatcaatagt tcgaaagtaa ttaatgacct 1860 

cttcagtgat gtcctagagg aaggtgaact agatatggag aagagccaag aggagatgga 1920 

tcaagcatta gcagaaagca gcgaagaaca ggaagatgca ctgaatatct cctcaatgtc 1980 

tttacttgca ccattggcac aaacagttgg tgtgagttta gtgtccacac ctagactgga 2040 

attgaaagac accagcagaa gtgatgaaag tccaaaacca ggaaaattcc aaagaactcg 2100 

tgtccctcga gctgaatctg gtgatagcct tggttctgaa gatcgtgatc ttctttacag 2160 

atctcaaaga ttcaaagaaa cagaacgtcc atcaataaag caggtgattg ttcggaagga 2220 

agatgttact tcaaaactgg atgaaaaaaa taatgccttt ccttgtcaag ttaatatcaa 2280 

acagaaaatg caggaactca ataacgaaat aaatatgcaa cagacagtga tctatcaagc 2340 

tagccaggct cttaactgct gtgttgatga agaacatgga aaagggtccc tagaagaagc 2400 

tgaagcagaa agacttcttc taattgcaac tgggaagaga acacttttga ttgatgaatt 246 0 

gaataaattg aagaacgaag gacctcagag gaagaattaa ggctagtccc cccaagtgaa 2520 

tttattgcca tcccaaagga tcagtttact ttgtcagaaa tccgcttgcc ctaaaaagca 2580 

gattttgtct gcagtacggt tcagaaacca gatgcagcaa attactatta cttaattata 2640 

cttaaaagca ggagctgaaa atatggtagc cacaccatta gcaagtactt caaactctct 2700 

taacggtgat gctctgacat tcactactac atttactctg caagatgtat ccaatgactt 2760 

tgaaataaat attgaagttt acagcttggt gcaaaagaaa gatccctcag gccttgataa 2820 

gaagaaaaaa acatccaagt ccaagaaaag caacattcat tcttcagtca tggccagtcc 288 0 

aggaggtctt agtgctgtgc gaaccagcaa cttcgccctt gttggatctt acacattatc 294 0 

attgtcttca gtaggaaata ctaagtttgt tctggacaag gtcccctttt tatcttcttt 3000 

ggaaggtcat atttatttaa aaataaaatg tcaagtgaat tccagtgttg aagaaagagg 3060 

ttttctaggc tgccctggag gagggcgact ccaaccaaag agacagacca fcatttgaaga 3120 

tgttagtggt tttggtgcct ggcatcgaag atggtgtgtt ctttctggaa actgtatatc 3180 

ttattggact tatccagatg atgagaaacg caagaatccc ataggaagga taaatctggc 3240 

taattgtacc agtcgtcaga tagaaccagc caacagagaa ttttgtgcaa gacgcaacac 3300 

ttttgaatta attactgtcc gaccacaaag agaagatgac cgagagactc ttgtcagcca 336 0 

atgcagggac acactctgtg tttaccaaga actggctgtc tgcagatact aaagaagagc 3420 

gggatctctg gatgcaaaaa ctcaatcaag ttctttgtga tattcgcctc tggcaacctg 3480 

atgcttgcta caaacctatt ggaaagcctt aaaccgggaa atttccatgc tatctagagg 3540 

tttttgatgt catcttaaga aacacactta agagcatcag atttactgat tgcattttat 3600 

gctttaagta cgaaagggtt tgtgccaata ttcactacgt attatgcagt atttatatct 3660 

tttgtatgta aaactttaac tgatttctgt cattcatcaa tgag 3704 



<210> 193 

<211> 2319 

<212> DNA 

<213> Homo sapiens 



<400> 193 

atgcaaataa atgagacaat ttgggatact gtgggagctg caagcaggca cggtgaaggt 60 

gaaaggcaag caaagagcag cacaaggggc tgcacacacc ttgctgaagg gcagggaatt 120 

tacctgcagg aggagcagag tccacctgag atgtgcacac gtgttatgga gaaaagggaa 180 

ggacttacaa ttgagagaga gagggatcca ctcctaccag tctggaaagc actgggcata 240 

caagcgcaca aatgcgtggc acatactaca aatccttcta aagccactgc tgtacacctg 300 

cctcacttaa ccatgcagcc tcaaggctgt ctcatgtcat tctttcctac agctgcagag 360 

ttttcaacat atggacagga attataccta gaaaataacc aaattgaaga aataactgaa 420 

atttgtttca atcataccag aaagatcaat gtcattgtac tacgttataa caaaattgaa 480 

gaaaatagga ttgctccttt agcctggata aatcaagaaa atctagaatc cattgatctc 540 

tcctacaaca agctctatca cgtcccgtcc tatctaccca agtccttgct gcacctagta 600 

ctccttggga accagattga acggatccct ggctatgtgt ttggccacat ggaaccaggc 660 

ctggaatact tgtacctgtc atttaacaaa cttgctgatg atggcatgga ccgtgtctcc 720 

ttctatgggg catatcattc tctgagagaa ttatttctgg atcacaatga cttaaaatct 780 

ataccacctg ggatacaaga aatgaaagca ctacattttc tgaggctgaa caacaacaag 840 
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atacgaggca ataagcagga aatcaaacag acaagtaagc aggcaagtgc tgtgcagtca 900 

gaaaaatggg tgacaatgcg acgagcacac tggggcctga gggccgccag gagactacgg 960 

ccaccctcaa cggcctggat taacagcaga tccagaccgg tgcctgtgga gcagacgcac 1020 

tgtgggcttg ctgtagcaga agagagaaaa gatctgttca tgtttttccg aagcctgcat 1080 

ttttttgtgg agtggtttga atatcgtaag cgcacgttta aacatctcaa gtgggatgaa 1140 

gactatgacc aagagccaga tgatgattac caaacaggat tcccatttcg tcaaaatgta 1200 

gactacggag ttccttttca tcagtatact ttaggctgtg tcagtgaatg cttctgtcca 1260 

actaactttc catcatcaat gtactgtgat aatcgcaaac tcaagactat cccaaatatt 132 0 

ccgatgcaca ttcagcaact ctaccttcag ttcaatgaaa ttgaggctgt gactgcaaat 1380 

tcattcatca atgcaactca tcttaaagaa attaacctca gccacaacaa aattaaatct 1440 

caaaagattg attatggtgt gtttgctaag cttccaaatc tactacaact tcatctagag 1500 

cataataatt tagaagaatt tccatttcct cttcctaaat ctctggaaag actccttctt 1560 

ggttacaatg aaatctccaa actgcagaca aatgctatgg atgggctagt aaacttgacc 1620 

atgcttgatc tctgttataa ttatcttcat gattctctgc taaaagacaa aatctttgcc 168 0 

aaaatggaaa aactaatgca gctcaacctc tgcagtaaca gattagaatc aatgcctcct 1740 

ggtttgcctt cttcacttat gtatctgtct ttagaaaata attcaatttc ttctataccc 1800 

gaaaaatact tcgacaaact tccaaaactt catactctaa gaatgtcaca caacaaacta 1860 

caagacatcc catataatat ttttaatctt cccaacattg tagaactcag tgttggacac 1920 

aacaaattga agcaagcatt ctatattcca agaaatttgg aacacctata cctacaaaat 1980 

aatgaaatag aaaagatgaa tcttacagtg atgtgtcctt ctattgaccc actacattac 2040 

caccatttaa catacattcg tgtggaccaa aataaactaa aagaaccaat aagctcatac 2100 

atcttcttct gcttccctca tatacacact atttattatg gtgaacaacg aagcactaat 2160 

ggtcaaacaa tacaactaaa gacacaagtt ttcaggagat ttccagatga tgatgatgaa 2220 

agtgaagatc acgatgatcc tgacaatgct catgagagcc cagaacaaga aggagcagaa 2280 

gggcactttg accttcatta ttatgaaaat caagaatag 2319 



<210> 194 
<211> 948 
<212> DNA 
<213> Homo sapiens 



<400> 194 

atgcctcgcc cccagcgcgc gctcccggcg ccggccccgc cccctcaccc aggcggcgcc 60 

gtaaagccgg agaaggggcg gggtctcagc tcctacttca ttctacggcc gagaccggag 120 

gatgttccct gctcaggagg aggccgacag gaccgtgttt gttgggaatt tagaggcccg 180 

agttcgggaa gagattctgt acgagctgtt ccttcaggtg ctctgtccaa gggagatggg 240 

aattttatct ataagcccct gactggggct gctgcctttc tttcagagat gccctgctca 300 

gacaggagga atctagagag gcaatcaggc tacagtgact ttgtagccct gagctttgtt 360 

tacactgtga ggggaaaacc acctactcaa gcctcagtaa tggcagaagc tcctcctccc 420 

accaagctta agcatcccag gtcaacttct gactgctgtg ctggcagtga gaatttcaag 480 

ccagtggatc ttagcttgct gggctccgtg gaagagggat ctgctgagca agaccacttg 540 

gctccctggc ttcagccccc tttccagggg agggaacagt tctgtctcgt tggtgttcca 600 

ggcgccactg gtgtacggaa aaaaaaaaaa caactcctgc agctaactgg gtgtctgccc 660 

aaacagctgc ccagatacct cagctgccac gtccagactc ttcccactct cccacctgca 720 

agagaaacat cctcaggcca aagaatgatg catgccacag ggcagtctgt ccatgagaga 780 

acaaacctga ggaatgagaa cccaggtccc aggaacaagc ttccatcatt tgagcagaga 840 

attctgggac ctgagtacct agatggtcaa gaagcaggaa atggggagga cactcctagg 900 

cagagagcca tgcagggaag gctagagtgg gaactgacta aagcatga 948 



<210> 195 
<211> 892 
<212> DNA 
<213> Homo sapiens 



<400> 195 

atgagtaggt ggggagcggc ggtggggcag ggcgcactgc gagaagagca cttcgcgcat 60 

gcgcatatca ccgagcgaac gcgccgtgtt cgggaaggta ggagaaagag gcggagctcc 120 

ttgttaacga catccccaac ctctgctaac gcgcaggcgc acttcctgaa gctgaaggtc 180 
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agcatagaca aagggcctca gaatcgcgca ggcgcaattg tgccctggtt cgccaagatg 240 

tcgttcccaa agtataagcc gtgcgagcct gcgcactctg ccgtgagacc ctcgacccag 300 

ccgaatacaa catatctccg gaaacccggc gggcgcaagc cggagcggtt ggcccatagg 360 

agcccagctg caaacgagag tacctgcttc agtacaacga tcccaacacg ccgagggctc 420 

atcgaaaatc cgtgccttgc ttcgttgggc ctatgcaaga acaataaatg tttattccta 480 

atttcaagcc ccactcctaa aagctcactc atgggagctt ttgtgtggga ttttgggccc 540 

ctcatcttca ttttattata ttatcaaaac tgagaggtgg gacccaaatc agagatggct 600 

cactgactca agaattttaa aatatgaagc tatcttacta gagagagatg acctgacact 660 

aaccactgac aattcactca accctgctgc tttcctaaga ggaaatccaa acccggaaga 720 

acctgagcat aaatgcttag atctaatcag ttatcaaact agagtcagac tggatctaag 780 

caaaaccccc ttccaaacag ggcgtcacct ctttatagac ggttcctccc tggtcattgg 840 

aggaaaaggg cacaacgggt actctgtagt tgatggggaa acccttacaa ag 892 



<210> 196 

<211> 597 

<212> DNA 

<213> Homo sapiens 



<400> 196 

atgcagactt ttacaacatg catcagctac agtgagtatt cttgcatgct tttggccaat 60 

gcaagctctc atggcacgct gtactgtaag ctccgggtcg gcatttgttt gttgatggta 120 

ccagctgtca aaaaccaagc aagtggcagt gctagaggtg ccacaaaagt gcgaaggaag 180 

tgtcaggctg gatgtcaaaa tgaacacctt ggagaactgg atgatggaac agacggtaaa 240 

aatcagctaa acatcagaga aaatggagga agaggtcaaa actgtgaaca ggaactagaa 300 

gaaagtgtag cagaaaaaga cttgtcacaa acttcgagag atttggagaa aatgatgtca 360 

aaacacatct tcctcaagcc catgctgagt atctctgatt tggttaattt cttgatgcaa 420 

gtttccaaag ttctggtgaa aacggctgaa ggaatagtgt tacagcaact gcccttggct 480 

tttccagctt tacacttcca tgcctatgga aatctcttcc ctgtctgtag ctttaagcat 540 

tacatctata tgatcgacca tcccatcttc atctccatcc ctgacttcct aacttga 597 



<210> 197 

<211> 4442 

<212> DNA 

<213> Homo sapiens 



<400> 197 
atgccagttc ccagcaggca 
tggtacgagg gcccctggga 
ctgacgtatg gcacagcaga 
gccctgcccg tggccgccga 
gcaagaaagg ggtctactcc 
ggtcgcagcc ccctgctgcc 
cccaaagagc ctcccctcta 
gtgcttggca gcagaacgtc 
gccggtcacc tgtaccggga 
agcagggctg catctcccgg 
gaggtgcctg cctaccccct 
cctgcccctg ccagacaggt 
gcccccgaag gcagcacagg 
cgggaaagca tcccatccaa 
cagggtcctg gtggcaagag 
gggctggcag aggccacgct 
ctggctaccc tggaggacgc 
gttctgagcc gcctggccaa 
gggggcccgc tgccccgggc 
ggtgacttgg cttctctggg 
gacccggctc gccgtccctc 
tattctgccc ctggcgtggg 



tataaatatt ggtcgttctc agagctggga tgctgctggg 60 

gaacgccgag tccctgcggc ctctggggag gagaagctcc 120 

gggg a <3ttgg tttgagccaa accaccgacc acaggacgct 180 

gccctacctg taccgggagg ccgtttataa ctcagtggct 240 

tgacttcacc ttctacgaca gcagacaggc agtgatgtct 300 

a cgggagtac tacagtgatc cgtctggagc tgctagggta 360 

tcgggaccca ggagtcagcc ggccggtccc cagctacgga 420 

atgggatcca atgcaaggcc ggtcacctgc cctgcaggac 480 

tcctggaggt aaaatgatcc ctcaggggcg gcagacacag 540 

gcggtatgga cgggagcagc ccgacaccag gtatggggcg 600 

cagccaggtc ttcagcgaca tcagcgaaag acccattgac 660 

ggccccgacg tgcctggttg tggaccccag ttcagctgct 720 

ggtggcccca ggggccctga atcgtggcta cgggcctgcc 780 

gatggcttac gagacttacg aagctgacct gtccaccttc 840 

gaccgtgctc cctgagttcc tggcctttct gcgggcggag 900 

ggg a gcccta ctgcagcagg gctttgactc cccggccgtc 960 

ggatatcaag tctgtcgcac ccaacctggg ccaggcccgt 1020 

tagctgcagg accgaaatgc agctgcggag gcaggaccgg 1080 

gcggtccagc agcttcagcc accgaagcga actgctccat 1140 

cgctgcggct cctctgcaga cagcatcccc ccgagctgga 1200 

cagcgcacca tctcagcacc tcctggagac ggcagccacc 1260 

cacccatgcc ccacacttcc cctccaactc cgggtacagc 1320 
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tctcccaccc cttgcgccct gacagcgcgt ctgagcccca cgtaccccct gcaggcaggg 1380 

gtggccttga ctaacccggg cccctcaaac ccccttcacc caggccccag aacagcctac 1440 

tccacggcat acacggtgcc catggagctg ctaaagaggg agcgcaacgt ggccgcctct 1500 

ccgctgccca gccctcacgg cagcccccag gtcttgcgga agccaggtgc acccctgggg 1560 

ccatccaccc tgccgccggc cagccagagc ctccacacgc ctcactcacc gtaccagaag 162 0 

gtggcccggc gcacaggggc acccatcatc gtgtccacca tgcttgcacc agaaccaatc 1680 

cagtttgctg gccaagcagt ccagagtgac aacgtgagga aggcgtatgc tgcggggacc 1740 

ccagtgcggc ccaccagccc cggtgacaca gacaaatggg ggctccaggc ccgcgctcct 18 00 

ggacgtgccg tagaccccag gaatatgatt tcagcccagg agcataaagt agttgaatgc 1860 

atggcccgaa ggagtgccac cfcgctttgtg tttgggcaac tctgccgcct tcacagcact 1920 

tcctctgacc ccgtgggtgt ggatttcatt ctgtccatgg aggacgtggg aaggggcaag 1980 

agtaggaacc ctgattcctg gtcccccaat gcagttgtgt gggatgccag tggcgttggc 2040 

gg^gagcgcg tcctgcaata ccagctagat atgaacacgg tgcctccgca gggctggacc 2100 

actcgaaaaa cgagagtgtg ctgcaagcat gaggcttctc cctctcccat ctcagccttg 2160 

gctgccatag ctaaggagga gggtgttatt ttgcttttgt ggacattcac cctgggaaat 2220 

aagaggcttg gaggatcggc cacacgagtg gggtatgcag aggcacaggc agaggctccc 2280 

tcctgcaagg ccactactgt gactctgagc agcggttctt ctcatgaatg tgatagctcg 2340 

gtcagctcta agactgcaac ctgcagggac ttcatggggc aaccctgggg gcacgcaagc 2400 

atcccaccca ctcccaaccc cccacctcca gccgtcgtgc ccggcatctt ttcccagcat 246 0 

gagaacccgt tggcttttct cttttcccgc cttgccatga aggacctgct gccaggattc 2520 

gagcctcaga ccttggacag gtctcgggcc agcctcagcc acgttctgcg tgcccggcca 2580 

tcagggaggg tagagggtat ccgcccccag atcatgaacg gccccctgca cccccgcccc 264 0 

ctggtggcgc tgctggacgg ccgcgactgc actgtggaga tgcccatcct gaaggacctg 2700 

gccactgtgg ccttctgtga cgcgcagtcg acgcaggaaa tccacgagaa ggttctaaac 2760 

gaagccgtgg gcgccatgat gtaccacacc atcaccctca ccagggagga cctggagaag 2820 

ttcaaggccc tgagagtgat cgtgcggata ggcagtggct atgacaacgt ggacatcaag 2880 

gctgccggcg agctcggtga gtgcgaggca gcgctggctg cctggagttg tccggagctc 2940 

tgtgggccct gctcgggtgg gcttggagaa gcagctggaa ctgggaccac ggagcagccg 3000 

cttctggctg tagcaaggtg gctgcctcca ggccgggctg tggagcacct tgcggccctg 3060 

ccttcacatg acacaggaat tgccgtgtgc aacatcccgt ctgcagccgt ggaagagaca 312 0 

gcggactcta ccatctgcca catcctcaac ctgtaccgga ggaacacgtg gctgtaccag 3180 

gcactgcggg aaggcacgcg ggttcagagc gtggagcaga tccgcgaggt ggcctcggga 3240 

gcggcccgca tccgtgggga gacgctgggc ctcattggct ttggtcgcac ggggcaggcg 33 00 

gttgcagttc gagccaaggc ctttggattc agcgtcatat tttatgaccc ctacttgcag 3360 

gatgggatcg agcggtccct gggcgtgcag agggtctaca ccctgcagga tttgctgtat 3420 

cagagcgact gcgtctcctt gcactgcaat cttcaacgaa cataaccacc accttcatca 3480 

atgactttac cataaagcag atgagggcag ggagcattcc tttgtggaac gcagcccgtg 3540 

gcggcctggt ggacgagaaa gccttagcac aagccctcaa ggagggcagg atacgagggg 3600 

cagccctcga cgtgcatgag tcagagccct tcagctttgc tcagggtccg ttgaaagatg 3660 

ccccgaatct catctgcact cctcacactg cctggtacag tgagcaggcg tcactggaga 3720 

tgagggaggc agctgccacc gagatccgcc gagccatcac aggtcgcatc ccagaaagct 3780 

taagaaattg tgtgaacaag gaattctttg tcacatcagc gccttggtca gtaatagacc 3840 

agcaagcaat tcatcctgag ctcaatggtg ccacatacag atatccgcca ggcatcgtgg 3900 

gtgtggctcc aggaggactt cctgcagcca tggaagggat catccctgga ggcatcccag 3960 

tgactcacaa cctcccgaca gttggcacat ccttcccaag cgccctctcc caaccagccc 4020 

acaaaacacg gggacaatcg agagcacccc aacgagcaat agcagagaat gccagaaggt 4080 

aatcactcag atacacttgg gaccaagaga cagtgaaaaa tagatgaact aagagaaaaa 4140 

gaatcggatg gtctttgtaa ctgattctgg acatatgcat cattgatgtt gcagtgttga 4200 

aactacaaga gctagaaaac tgaagatgtc gtcttgctta cggaaggcgc tggaaagact 4260 

agggtgtgga tttatttaac gaccaacttc tgttattgtg tgttaagttt ttcatctgtg 4320 

catcaaatca caaaaagaat aaatagagct ttttccttta tcagtccctt ggggcacagc 4380 

cgggtcctga acaccctgct ctacaatgtt gcatcaagag ttcaaacaac aaaataaaaa 4440 

at 4442 



<210> 198 
<:211> 573 
<212> DMA 
<213> Homo sapiens 

<400> 198 

ccgaatttcc gtgtcgacga tctcgtagct tcttaagata tcaggaatta ttcttaaaac 60 
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gggagagagt cagaatcaac tagctgtgga tcaaatagct ttccaaaaga agctctttca 120 

gaccctgagg agacaccctt cctatcccaa aataatagaa gaatttgtta gtggcctgga 180 

gtcttacatt gaggatgaag acagtttcag gaactgcctt ttgtcttgtg agcgtctgca 240 

ggatgaggaa gccagtatgg gtgcatctta ttctaagagt ctcatcaaac tgcttctggg 300 

gattgacata ctgcagcctg ccattatcaa aaccttattt gagaagttgc cagaatattt 360 

ttttgaaaac aagaacagtg atgaaatcaa catacctcga ctcattgtca gtcaactaaa 420 

atggcttgac agagttgtgg atggcaagga cctcaccacc aagatcatgc agctgatcag 480 

tattgctcca gagaacctgc agcatgacat catcaccagc ctacctgaga tcctagggga 540 

ttcccagcac gctgatgtgg ggaaagaact cag 573 



<210> 199 

<211> 6286 

<212> DNA 

<213> Homo sapiens 



<400> 199 

gacgcgccgg agccggtttt ccagataaca gaaagtaacg tgaaggaatt caggtgactc 60 

agacatggag gagagaagac ctcatctgga tgccaggccc aggaattccc ataccaacca 120 

cagaggccct gtggatggag agttaccacc aagagctaga aatcaggcca ataacccacc 180 

agccaatgct ctccgaggag gagccagcca ccctggaagg catccfcaggg ccaacaacca 240 

tcctgctgct tactggcaga gggaagagag atttagggcc atgggcagga acccacatca 300 

aggaaggagg aaccaggagg ggcatgccag cgacgaagct agagaccaaa gacatgacca 360 

ggagaatgac accaggtgga gaaatggcaa ccaggactgt aggaaccgca gaccaccatg 420 

gtccaatgac aacttccagc agtggcggac tccccaccag aagcctacag aacagccaca 480 

gcaggcgaag aaactgggct acaagttctt agaaagtctt ctgcagaaag acccttctga 540 

ggtggtcatc acacttgcca caagtttagg gctgaaagag ctcctttctc attcttccat 600 

gaaatctaac ttccttgagc tcatctgtca ggttcttcgg aaggcttgta gctccaaaat 660 

ggatcgccag agtgttctcc atgtactggg catattgaaa aactccaaat ttctcaaagt 720 

ctgcctgcct gcttatgtgg tagggatgat cactgaaccc atccctgaca tccgaaacca 780 

gtatccagag cacataagca acatcatctc cctcctccag gaccttgtaa gtgtcttccc 840 

tgccagctct gtgcaggaaa cttccatgct ggtttccctc ctgccaacct ctcttaatgc 900 

tctgagagcc tctggtgttg acatagaaga ggaaacggag aagaacctgg aaaaggtaca 960 

gatctatcat tgaccatctg caggaaaaga ggcgagaggt cactttgaga gtggatacgt 1020 

acactctagt gcagcctgag gcagaagacc atgttgagag ctaccgatcc atgcccattt 1080 

accctacgta caatgaagtg cacttggatg agaggccctt ccttcgcccc aatatcattt 1140 

ctggaaaata cgacagcact gctatctatc tggataccca cttccggctc ctgcgagaag 1200 

aaatcgtcag acctttacgg gaaggtattt tggaacttct ccaaagcttt gaagaccagg 1260 

gcctgaggaa gagaaagttt gatgacatcc gaatctactt tgacaccagg attatcaccc 1320 

ccatgtgttc atcatcaggc atagtctaca aggtgcagtt tgacacaaaa ccactgaagt 1380 

ttgttcgctg gcagaattcc aaacgattgc tctatgggtc tttggtatgc atgtccaagg 1440 

acaacttcga gacatttctt tttgccaccg tatctaacag ggagcaggaa gatctctgcc 1500 

gaggaattgt ccagctctgc ttcaatgagc aaagccaaca gctgctagca gaggtccagc 1560 

cctctgactc tttcctcatg gtagagacaa ctgcatactt tgaggcctac aggcacgtcc 1620 

tggaaggact ccaggaggtc caggaggaag atgttccctt ccagaggaat atcgtggagt 1680 

gtaactctca tgtgaaggag ccaaggtact tgctaatggg gggcagatac gactttaccc 1740 

ccttaataga gaatccttca gccactgggg aatttctaag aaatgtcgag ggtttgagac 1800 

atcccagaat taatgtctta gatcctggcc agtggccctc aaaagaagcc ctgaagctgg 1860 

atgactccca gatggaagcc ttgcagtttg ctctcacaag ggaactggct attattcaag 1920 

gacctcctgg gaacaggcaa aacctatgtg ggtctaaaaa ttgttcaggc cctcctaacc 1980 

aacgagtctg tttggcaaat tagcctccag aagttcccca tcttggttgt gtgttatact 2040 

aatcatgctt tgggaccagt ttctgggaag gcatctacaa ttgtcaggaa gaccaggcat 2100 

tgtgcgggtg gggtggaagg gagcaacagt ggaaattcct gaaggcagtt tcaccctaag 2160 

ggagctgagg aacaagcggg aattccgccg caacctcccc atgcacctcc gaagggccta 2220 

catgagtatc atgacacaga tgaaggagtc agagcaagag cttcatgaag gagccaagac 2280 

cctggagtgc accatgcgtg gtgtcctacg ggaacagtac ctgcagaagt acatctcacc 2340 

cccagcactg ggaaagtctc atgaatggcc ccagtgcagg atagtgaatg gattttcttc 2400 

ccagcactgg aagcattccc atgatgctgg agtggcttag gtcttggtgt cggttctttc 2460 

acgcaaagtg tttctccagc aggacctgag aatacagccc caggcagaag gggatgagga 2520 

ggaagaaggg gaggaggaga gtttcgcttg attagagatt cgcagaggga agctgaccct 2580 

gatttcaagc agaccgggtg attgaggagg aagaggtggt gaggccccag cggcggaaga 2640 

aggaagagag tggagcagac caggagttgg ctaaaatgct tctggccatg aggctagacc 2700 
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attgtggcac tgggacagca gctggacagg agcaagccac aggagagtgg cagacccagc 2760 

gcaaccagaa aaagaaaatg aaaaaaagag tgaaggatga gcttcgcaaa ctgaacacca 2820 

tgactgcagc cgaggccaac gagatcgagg atgtttggca gctggacctc agttctcgct 2880 

ggcagcttta taggctctgg ctacagttgt accaggctga cacccgccgg aagatcctca 2940 

gctatgaacg ccagtaccgc acatcagcag aaagaatggc cgagctgaga ctccaggaag 3000 

acctgcacat tcttaaagat gcccaggttg taggaatgac aaccacaggt gctgccaaat 3060 

accgccagat cctacagaag gtggagccga ggattgtcat agtggaagaa gctgcggaag 3120 

tccttgaggc ccataccatt gccacattga gcaaagcttg ccagcacctc attttgattg 318 0 

gggaccacca gcagctgcgc cccagtgcca acgtgtatga tctggccaag aacttcaacc 3240 

ttgaggtgtc cctttttgaa cggctagtga aagtaaacat tccctttgtc cgtctgaatt 3300 

accagcaccg tatgtgccct gaaattgccc gccttttgac cccccacatt taccaggatc 3360 

tggagaatca tccatctgtt cttaagtatg agaagattaa gggggtgtct tccaaccttt 3420 

tctttgtaga acacaacttt cctgaacagg aaatccaaga ggcgcaaaag ccatcagaac 3480 

cagcatgagg ctcacaatgt ggtagagctg tgcaagtact tcctgtgcca ggaatacctg 3540 

ccttcccaga tcaccatcct cactacctat accgggcagc tcttctgcct gcgcaaactg 3600 

atgcctgcca agacatttgc tggcgtcagg gtccatgttg tggacaaata ccaaggggaa 3660 

gagaatgaca tcatcctcct ctcgctagtg cggagcaacc aagaaggcaa ggtgggtttt 3720 

ctgcagatat ccaaccgcat ctgtgtggcc ttgtcccgag ccaagaaggg aatgtactgc 3780 

atcggaaaca tgcagatgct ggccaaggtg cccctgtgga gcaagatcat tcatacactt 3840 

cgagagaaca atcaaatagg ccccatgctc cggctctgct gccagaacca ccctgaaacc 3900 

cacaccttag tatccaaagc ttctgacttc caaaaagtac ccgaaggagg ctgcagcctg 3960 

ccctgcgagt tccgcctggg ctgtgggcat gtctgcaccc gtgcctgcca cccttatgac 4020 

tcttcacaca aggagttcca atgcatgaag ccatgccaga aggtcatctg tcaggaaggg 4080 

caccggtgtc cccttgtttg cttccaggag tgtcagcctt gtcaggtgaa ggtgcccaaa 4140 

accattcctc ggtgcggcca tgaacaaatg gtcccttgtt ccgtgcctga gtcagatttc 4200 

tgctgccagg agccttgctc caagtctctg agatgtgggc acagatgcag ccacccatgt 4260 

ggtgaggact gtgtgcagct gtgttcagaa atggtcacca taaaactcaa gtgtgggcac 4320 

agtcaaccgg taaaatgtgg tcatgtggaa ggcctcctgt atggtggtct gctagtcaag 4380 

tgtaccacaa agtgtggcac tatcttggac tgcgggcatc cttgcccagg ctcctgccac 4440 

agctgcttcg aagggcgttt ccatgaacgc tgtcagcagc cctgcaagcg cctgcttatc 4500 

tgctcacaca agtgcccaga aaccatgcat tggtgagtgc ccaccctgcc agcggacctg 4560 

tcagaaccgc tgtgtccaca gccagtgcaa gaagaaatgt gaggagctgt gtagtccctg 462 0 

cgtggaaccc atgtgctcgc gctgccagca ctaccagtgc accaaactct gctctgagcc 4680 

ctgcaaccga cccccatgct atgtgccttg tactaagctg ctagtttgtg gccacccctg 474 0 

cattggtctc tgtggggagc catgccccaa gaaatgccgg atctgccaca tggatgaggt 4800 

cacccaaata ttctttggct ttgaggatga gcctgatgcc cgctttgtgc agctggaaga 4860 

ctgcagccac aatttttgag gtgcaagccc tagaccgcta catgaatgaa cagaaggatg 4920 

atgaagtcgc catcagattg aaagtctgcc ctatctgcca ggtgcccatc cgcaaaaacc 4980 

tgagttatgg aactagcata aaacagcggc tagaggagat tgaaafccatc gaggaaaagt 5040 

atccagggct catcagggga aatggcaacc agccaggaac ggcttaaggc cctgctggag 5100 

aggaagagcc tcctccacca gctgcttcct gaagacttcc tgatgttaaa ggagaagctg 5160 

gcccagaaaa atctgtcagt gaaggacctg ggtctggttg agaattacat cagcttctat 522 0 

gaccacctgg ccagcctgtg ggattccctg aaaaagatgc atgtcttaga agagaaaaga 5280 

gtgaggactc gactagaaca ggtccatgag tggctggcca agaagcgctt gagcttcact 5340 

agccaggaac taagtgacct ccgaagtgaa atccagaggc tcacatacct ggtgaacctt 5400 

ctgacccgct acaagatagc agagaagaag gtgaaagata gcatagcagt agaggtctat 5460 

agtgtccaga atatccttga gaaaacatgt aagttcaccc aagaggatga acaacttgtg 5520 

caggaaaaga tggaagctct gaaagccacc cttccctgct ctggcctggg catctcagag 5580 

gaagagcgag tgcagattgt cagtgccata ggttatcctc gtggtcactg gttcaagtgc 5640 

cgcaatggcc atatctatgt gattggcgat tgtgggggag ccatggagag gggcacgtgt 5700 

cctgactgta aggaagtgat tggtggcaca aatcatactc tggaaagaag caaccagctt 5760 

gcttctgaaa tggatggagc ccagcatgct gcctggtctg acacggccaa caacctgatg 5820 

aactttgagg agatccaggg gatgatgtag gaagatggta caccactgcc ttttgccctc 5880 

gccactgaat gactggggcc agctccctaa tgaaggaact gaagtttgtt ttttattatc 5940 

atccttttta ggctgggcgc agtggcttac gcctgtaatc ccagcacttt gggaggccga 6000 

ggcaggcgga tcacctgagg tcaggagttc gagaccaagc ctgaccaaca tggcgaaacc 6060 

ccgtctctac taaaaataca aaaataaaat aaaataaaat caaaataagc caggcgtggt 6120 

ggcgtgcgcc tgtaatccca gctactccgg gaggctgagg caggagagtc acttgaaccg 6180 

gggaggcgga ggttacagtg agccgagatc atgccattgc attccagcct gggcgacagg 6240 

agcaagactc tgtctcaaaa ataaataaat aaataaataa ataaaa 6286 



<210> 200 
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<211> 2040 

<2X2> DNA 

<213> Homo sapiens 



<400> 200 

gcgagaactc atcctgtagt caccagatgg agtcccaaac agccaagcag atgtaaggcc 60 

tgtgctgtgg ctctgaggcc ctgaatacag aagggtcact ttcttagtgg ccaaagagca 120 

gttgttgaca ttgatgtcta attattgaac acgaccagtc attttactga gctgcggtga 180 

ggaaacactg accatagaag atcaagccaa atgagggatt gcaaatttcc tgattctttt 240 

gaattaggat tccagatggg ggcctcattt ctacagcccc caacattcct atagccgtta 300 

tcactgccat caccactgcc accagcatct tcttgcagat tccacccctg ctccccagag 360 

acttcctgct ttgaaagtga gcagaaagga agctctcaga aaaatctcta gtggtggctg 420 

ccgtcgctcc agacaatcgg aatcctgcct tcaccaccat gggctggctt . tttctaaagg 480 

ttttgttggc gggagtgagt ttctcaggat ttctttatcc tcttgtggat ttttgcatca 540 

gtgggaaaac aagaggacag aagccaaact ttgtgattat tttggccgat gacatggggt 600 

ggggtgacct gggagcaaac tgggcagaaa caaaggacac tgccaacctt gataagatgg 660 

cttcggaggg aatgaggttt gtggatttcc atgcagctgc ctccacctgc tcaccctccc 720 

gggcttcctt gctcaccggc cggcttggcc ttcgcaatgg agtcacacgc aactttgcag 780 

tcacttctgt gggaggcctt ccgctcaacg agaccacctt ggcagaggtg ctgcagcagg 840 

cgggttacgt cactgggata ataggcaaat ggcatcttgg acaccacggc tcttatcacc 900 

ccaacttccg tggttttgat tactactttg gaatcccata tagccatgat atgggctgta 960 

ctgatactcc aggctacaac caccctcctt gtccagcgtg tccacagggt gatggaccat 1020 

caaggaacct tcaaagagac tgttacactg acgtggccct ccctctttat gaaaacctca 1080 

acattgtgga gcagccggtg aacttgagca gccttgccca gaagtatgct gagaaagcaa 1140 

cccagttcat ccagcgtgca agcaccagcg ggaggccctt cctgctctat gtggctctgg 1200 

cccacatgca cgtgccctta cctgtgactc aactaccagc agcgccacgg ggcagaaaag 1260 

cctgtatggt gcagggctct gggagatgga cagtctggtg ggccagatca aggacaaagt 1320 

tgaccacaca gtgaaggaaa acacattcct ctggtttaca ggagacaatg gcccgtgggc 1380 

tcagaagtgt gagctagcgg gcagtgtggg tcccttcact ggattttggc aaactcgtca 1440 

agggggaagt ccagccaagc agacgacctg ggaaggaggg caccgggtcc cagcactggc 1500 

ttactggcct ggcagagttc cagttaatgt caccagcact gccttgttaa gcgtgctggg 1560 

acatttttcc aactgtggta gccctggccc aggccagctt tacctcaagg acggcgcttt 162 0 

gatggtgtgg acgtctccga ggtgctcttt ggccggtcac agcctggggc acagggtgct 1680 

gtttccaccc caacagcggg gcagctggga gattttggag ccctgcagac tgtccgcctg 1740 

gagcgttaca aggccttcta cattaccggt ggagccaggg cgtgtgatgg gagcacgggg 1800 

cctgagctgc agcataagtt tcctctgatt ttcaacctgg aagacgatac cgcagaagct 1860 

gtgcccctag aaagaggtgg tgcggagtac caggctgtgc tgcccgaggt cagaaaggtt 1920 

cttgcagacg tcctccaaga cattgccaac gacaacatct ccagcccaga ttacactcag 1980 

gacccttcag taactccctg ctgtaatccc taccaaattg cctgccgctg tcaagccgca 2040 



<210> 201 
<211> 1505 
<212> DNA 

<213> Homo sapiens 



<:400> 201 

cccgttgcta cgacgatttc gcagcccctg agtctggagg ctgacatgtg gagcataggc 60 

gtcatcacct acatcctctt aagtggagca tcccctttcc tgggagacac gaagcaggaa 120 

acactggcaa atatcacagc agtgagttac gactttgatg aggaattctt cagcgcagac 180 

gagcgagctg gcccaggact ttattcggaa gcttctgggt taagagaccc ggaaacgggt 240 

cacaatccaa gaggctctca gacacccctg gatcacgtca aaaggagaag gcagagcccc 300 

tgaacagcgg aagacagagc ccacccagct gaagaccaag cacctgagag agtacactct 360 

caaatgccac tcaagcatgc ctcccaacaa ctgctatgtc aacttcgagc gttttgcctg 420 

tgtggtggaa gacgtggctc gggtcgacct gggatgccgt gccctggtgg aggcccacga 480 

taccatccag gacgatgtgg aggccctggt ctccatcttc aatgagaagg aagcttggta 540 

ccgagacgag aatgagagcg cccggcatga cctgtcccag ctcaggtatg agttccgcaa 600 

ggtggagtca ttgaagaagc tgctgcgaga agacatccag gccacgggat gtagcctcgg 660 

aagcatggcc aggaaactgg accatctgca agcccagttt gagattctga ggcaggagct 720 

ctccgcagac ctgcagtgga ttcaggagct ggtgggcagc ttccagctgg agagtgggag 780 

ctctgagggc ctgggctcca cattctacca ggacaccagt gaatccctgt cagagctgct 840 



120 



WO 01/74836 



PCT/US01/10472 



cagcagatca tgcactgagg aatttctggc aggctggaag ctctaatcgc cagaatccag 900 

tcagtgatgc atcccaagag aaagggtata ctcgccagca gttctgccac gagagtacac 960 

cgaacaaaag agacagggtc atttataatc tcacgcgtcc accctactgc tgtgtccagt 102 0 

ttccactggc tggaacggga cctgacattc tgtatttgtc ccgactggct agcaactcag 1080 

aactttttaa aagaggcaaa ggcagaggag aatcaaggaa gcaggaagtt aacttgtgga 1140 

atgctgagaa aggtaaaaaa aacttttaaa taaggaagag gaacaggcta tgacctaatg 1200 

cttgcttgga ccagtataag catgccaggg caaatattta ggctaaattg tgggagctaa 1260 

gaacataaag cacattgatt tctttattac ggctggcaga tatttaaaaa tgttagcacg 1320 

ggtctttgaa tcaattttgc ttctaagaga agctactatt tattcttaat tagatgggga 1380 

ggaaagtctt tgaaaaggaa cctctacttt actttttagt tactatgagg atcactatat 1440 

caatgtttct taattcatat gctaaaggaa ttccaccgca ctggctagga tctcgatgtg 1500 

ccgcg 1505 



<210> 202 
<211> 346 
<212> DNA 

<213> Homo sapiens 
<400> 20*2 

gggaaaagaa ggaggaaatg gtatgtccaa agggtcatgg agggcacaga gctctcaaat 60 
gatgagcagg tgatggagtt gtaataggag cctgaagagg cagactatag ctaaaaacca 120 
tgtgaggact atgacctcac cttgagagca tgggaaatac ccttgaactg ctgaaagcag 180 
gtacctatgg tgccagtttt cagtgtggag aaggatggag aggagctcgg tagcttcagg 24 0 
ccacgctggg ccgattggct gactggatta ctggagtggg tttcagtaga gagcctttct 300 
atctattgca tttctcaacc tgtttatatg tgggtggaat aaggct 346 



<210> 203 
<211> 207 
<212> DNA 

<213> Homo sapiens 



<400> 203 

cggccgcgtc gacgcccaac taaagtgtgc tgcgttctct gggtcctgag gcggcagtct 60 

ggccgcagtg accagactaa ctgtgccccg gaggctgctt ggaattccac ttaacgctgg 120 

aagcacgcag acgctcttga ggcgggcagc tgcgtagcgt gtggcggagc cgggagagcg 180 

ccccccaatg tggcacctga gcgtctg 207 



<=210> 204 
<211> 2847 
<212> DNA 

<213> Homo sapiens 



<400> 204 

ttaaaacgag gacttggaca aggagggaac cactggagtg acgtgatgca agccctcttc 60 

ggtcactctt ggcttccacc aggggtctct gcctccttga tgagaagaga agctggctgg 120 

ctccatgatg acctcgatgg cgtacaccct aacatcctca ttcaccagac ccagggtcag 180 

ctttaggaag ctcatctggc ggctggctac cagctcctct gtctccttgt cgtgggtagg 240 

caggtgcagg tggtggcaga aggtgtagag gaaggccttg gagcagagct gggtgagcat 300 

gctctggaca tgcaggtagt aggtgctgcc ccgcttgatc tgtgtgcggt ggtctgccag 360 

gcggggcacc agggtgcctc tgtaaagggg gcagcgcagg gttttgttgt caaggtccag 420 

gatgctcgtg tagcggctgt agcggctcag ggcatggagg gtaccggcac cggcagggga 480 

ggccactctc tgcaagccaa taagcttcca cttctgaaaa tccataatgt gcaaagggga 540 

ggaggcccaa tgtttcacgg gcttagcgta gcagccatag ctggggacaa agatagccag 600 

tgcagtcacg agtttcctga ctatggcctc atcttccccc aggaccacaa gtgtcctccc 660 

actgagcagg gagtagatgg ctgggtgggc aaaggggtac tggcggatga attttaaggc 720 
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gttctggcca gccctctttt tatgcctatc ggaagacagg ccagcaggat aagcagaggg 780 

gatcctgtct gagctggtgg acgctacact gctcacgtag ctggtggtgt ctgagtagtt 840 

gtccaggctg gtcttactga tgtcccggct agccagcagg tggctcgggt ccagctcata 900 

agcgggaaac ccttcacaac tgttgtctcg ggaagaaggg ttggcatttt ccactgaaaa 960 

gtctacacgg aacccctggt cgttctgcct gtggcgctgt ggggggctgc tcaccacacc 1020 

atcctcgtca gagtgtgtgt gggctggagt ggagggcagc accaactgac catcgctctc 1080 

cttcccaata cagcagcagg agtctatttg tgatggggtg tattctatgc tgccttcctc 1140 

tgtctcaccc agttctggag gactgatgct tgcctggaag tggatggctc gttcattgcc 1200 

atctggatat gattcttctt cattaatggc acttgggtag ctggccttaa agtcatcagg 1260 

gatgagggcc tcagaggggc aggtactgag gacttcaatc ctgtcctcgc tgatggtcct 1320 

gtgactgact gctttgctcc ggaccacctg ggggctcaat ggtactgtaa ggcttgcctg 1380 

gctgtcagat ttagaaagca cggaggtgga tttctccgtg cccaaacctt caatactttc 1440 

accactgctg atactccctt tcatatccat atccaggtag tccaagtttt cctgggggtc 1500 

gaaactgctc aactcagtca cttccacttc cttgtactcc tcatctccca gttcctgctc 1560 

catcttgatc aacacagact ccacactgga cttgtaagaa ccaacactaa ttaacacttt 1620 

aggaattggg cattcttcta gagaactgga aggcggcctg tcttgactgg gcttagaggg 1680 

tatgctttct tgtttctcca ccatcctgtc atcgacctcc acaaagtctt caaagagaaa 1740 

gtttgttata tgctgttgtt ttagaagtgc tctatcaatc tgactggtca ggaggtaaca 1800 

caggtctcct ctgaacatgt gttcaatgtg gctgagctga gccagggtct gggtgaaata 1860 

ttcagtgtca cagagctcct ccaaggtctt caacttcttg tcaaaacact tggtggactt 1920 

tgctttgata agttttgggg tgtaggctgg tctgcaggtg taaaggtctg tgtcggcaga 1980 

ctcatcaggg ttagaggtag tggatgcctg gctggcctga tcctggatgt gctccatctc 2040 

accaggacac aaatcatgcc ccttcaactt ccgatgaggg tatgagcgga tctgcttcag 2100 

caggtcttga tgttcaatga tggacttctc cacactggcc agtttattgg ctttctcaat 2160 

tgcctgagat gagtaaaagc ctttgtcgtt ggctttcttc tggatctccg tttctgtgtg 2220 

tagcactgtc ctggtgtaat ccaagtcttt cagctttttt tcaagttccc cagcaaatgc 2280 

cttcctgttg ccagtcttca agcactcaga agctctggaa aattcggctg aaagctcctg 2340 

gaactgctgc atgattttat gctggtctgc agagatataa gccatgcaaa acggcctcac 2400 

gaagccacgg gcctccaggt cgtatagggt aaggtggtgc acgtatgcaa aggcgccctc 2460 

cttagaatct cccagcacca ccttggagtc ctccacgaag ttcagcttgg ggtaggcaga 2520 

tccaggagga tggcccacga aggaagcctg gtaatccaca gacatgatac gcagggagaa 2580 

gtaattgaga tcaaaagtgc caaaaacttt ggtgtcattg gggatggtca gtaagggttg 2640 

gggtcccacc tgctcagaga actcggaaat aagaatgaag tccctcgaaa acttggcccc 2700 

ggacagtttt gaccaggggt tagcaccctg actggcgaag gggaagagcg gcaccgagta 2760 

ctcctcaggc agggccggct cattgtaagg ctcttcttca tactcttcct ctttggtgaa 2820 

ggccactacg tcaggggcgc tgatcat 2847 



<210> 205 

<211> 7653 

<212> DNA 

<213> Homo sapiens 



<400> 205 

agtgtgtata aggcagacct ggagtggctg cgtggcatcg gctggatgcc agaaggctca 60 

gtggaaatga acagagtgaa ggttgctcaa gacctcgtga atgaaagact ctataggaca 120 

cgtccagaag ctttgtcatt caccagcatt gtcgacactc cagaagttgt cttggcaaaa 180 

gccaattctc tgcaaataag tgagaaactg tatcaggaag cctggaataa agataaaagc 240 

aacatcacca ttccttctga tactccggag atgctgcagg cccacatcaa tgccttgcaa 300 

atcagcaata aactctacca aaaagactgg aatgacacca agcagaaagg ctatgacata 360 

agggcagatg ccattgaaat caagcacgcc aaggcctcca gagaaattgc cagtgagtac 420 

aaatacaaag aaggttaccg taagcaactg ggccaccaca tgggtttccg caccctacaa 480 

gatgacccca agtcagtatg ggctatacat gctgccaaga tccagagtga cagagaatat 540 

aagaaagctt atgagaagtc taaaggaatt cacaacacac cgttggacat gatgtcaatt 600 

gttcaagcca agaaatgcca ggtcctggtt agcgacattg attatcgcaa ttatctgcac 660 

cagtggacgt gtctgccaga tcagaacgat gtgatccagg ccaagaaagc ctacgacctg 720 

cagagcgatc cattgtatcg caatgcctgg gagaaagaga aggctaatgt gaacgtgcca 780 

gctgacactc ccctgatgct gcaatccaaa atcaatgctc tgcagatcag caataaacgc 840 

tatcagcaag cttgggaaga tgtcaagatg actggttatg acctgcgagc agatgccatt 900 

gggatccagc atgccaaggc ttccagggat attgccagtg attatctgta caaaactgct 960 

tatgagaaac agaaaggcca ttacattggc tgtcgcagcg ccaaggaaga ccctaaactg 1020 

gtttgggcag caaatgtgtt gaagatgcag aatgacaggc tgtacaaaaa ggcctacaac 1080 
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gaccacaaag ccaagatctc catccctgtg gacatggtgt ccatcagcgc tgccaaagaa 1140 

ggtcaggcac tggcaagtga tgtggactat cgccattacc tgcaccactg gtcttgcttt 1200 

cccgaccaga atgatgtgat ccaagccagg aaagcctacg acctacagag cgacacagag 1260 

ccttgctctc tggcccaggc tggagtacag tgggtggccg acatgacggc caggggtcag 1320 

agccccctcg cgccgctgtt ggagactttg gaagaccctt ctgcctccca tggagggcag 1380 

actgacgctt acctgactct gaccagtcgt atgactggag aagaaggaaa agaagtaatt 144 0 

acagaaattg agaaaaaact tcctcggctg tacaaagttt taaaggtatc cagtataatt 1500 

gattcattag aaatactgtt taacaaagga gagacgcatt ctgctgttgt tgattttgaa 1560 

gcattaaatg ttatcgtaag gctaattgaa caagccccaa ttcaaatggg agaagaggca 1620 

gtgaggtggg caaaactggt cataccttta gtggttcatt cagcacaaaa ggtacatttg 1680 

cggggagcaa ctgctctgga gatgggaatg ccattattgc ttcagaaaca gcaagaaata 1740 

gcatctatta cggagcagct tatgactact accttgcatc gaagtgggag tttcatcaat 1800 

tctctcttgc aactagaaga acttggattt cgtagtggag cacccatgat taaaaagata 1860 

gcttttattg cttggaagag tttaatagat aattttgctt taaatccaga tatactatgt 1920 

agtgcaaaaa gactcaagtt gttaatgcag cctttgagtt ccatccatgt gagaacagaa 1980 

actctagcat taacaaaact agaagtctgg tggtatttac tgatgagact tggacctcat 2040 

cttcctgcta attttgaaca ggtttgtgtg cctctgattc aaagtacaat aagcattgat 2100 

tctaatgcct cacctcaggg caattcgtgt catgtagcta catctccagg tttaaatcct 2160 

atgactcctg tacacaaagg tgcttcctcc ccgtacggag ccccgggaac tccccgaatg 2220 

aacctgagtt cgaatttagg tggaatggcc acaatcccat ccattcaact tttgggactt 2280 

gaaatgttgc ttcatttctt gttgggtcca gaagccttga gttttgctaa gcaaaataaa 2340 

cttgtgctga gcttagagcc attggaacat ccgttaatca gcagcccttc ctttttttcc 2400 

aaacatgcaa atacacttat cactgctgtt catgatagct ttgttgcagt tggaaaagat 2460 

gcccccggta acaaaaaaga gaaaccaggt tctgaagttt tgactctctt attaaagtct 2520 

ttggaaagca tagtaaagtc tgaagtattt cctgtatcaa aaacgctggg aactccagct 2580 

ttgttcttaa ttcaattaat tttcaacaat ttcttggaat gtggtgtatc agatgaaagg 2640 

ttctttctca gtttggaatc acttgtaggc tgtgttcttt ctggtccaac ttcaccacta 2700 

gctttcagtg actcagtttt aaatgttatt aatcaaaatg caaagcagtt ggaaaataag 2760 

gagcatctct ggaaaatgtg gagtgttata gtcaccccat taactgaatt gattaatcag 2820 

accaatgaag taaatcaagg tgatgcctta gaacataatt ttagtgccat ctatggtgca 2880 

ttgactttac cagtaaacca cattttttca gaacagagat ttccagtggc caccatgaag 2940 

actttgctta gaacttggtc agaattatat agagcatttg ctcgttgtgc tgctttggtg 3000 

gcaacagcag aagagaactt gtgctgtgag gaactttctt ccaagataat gtccagtttg 3 060 

gaagatgaag gcttttctaa tttgttgttc gtggatagaa ttatttatat tattactgta 3120 

atggttgatt gcattgactt ctcaccatat aatattaaat atcagcccaa agttaaatca 3180 

ccacagagac cttcagattg gtccaaaaag aagaatgagc ccctagggaa attgacttct 3240 

ttatttaaac ttattgtgaa agtgatctat tctttccaca cactgagctt caaggaagca 3300 

cattctgata ccctcttcac tattggcaac tcaatcaccg gcattatttc cagtgtactt 3360 

gggcatattt ctttgccttc tatgatccga aaaatatttg caactttaac aagacctctg 3420 

gcattatttt atgaaaactc aaagcttgat gaagttccta aagtatatag ttgtctgaac 3480 

aacaagttag aaaagctact gggagaaatt attgcttgtc tgcaattcag ctacaccgga 3540 

acttatgata gtgaacttct tgaacaactc tccccactat tatgcataat atttctgcac 3600 

aagaataaac agattcgaaa acagagtgct cagttctgga atgccacttt tgccaaagtg 3660 

atgatgttgg tttatcctga agagttaaaa ccagtactaa cacaagccaa acaaaaattt 3 720 

ctgctcctgt tgcctggttt ggaaactgtt gaaatgatgg aggaatccag tggaccatat 3780 

tctgatggac tgaaacttga atcttcgtct ttaaaagtaa agggtgaaat tcttttggaa 3840 

gaggaaaagt ctactgactt tgtgtttata cctccagaag gaaaagatgc aaaggaaaga 3 900 

atattaactg atcatcaaaa agaagttctc aaaacaaagc ggtttgagga gcaaatggac 3960 

agtgacattg tcattcctca agatgtcacg gaagactgtg gtatggctga acatcttgaa 4020 

aagtcctccc tttcgaataa tgagtgtggt tctcttgaca aaaccagtcc agaaatgtca 4080 

aacagtaata atgatgaaag aaaaaaagct ttaatttcat caaggaaaac atcaactgaa 4140 

tgtgcatcta gtacagaaaa ttctttcgtt gtcagcagta gttcagtttc taataccact 4200 

gttgctggaa ctcccccata ccctacaagt cggaggcaaa cctttattac tttggagaag 4260 

tttgatggtt cagaaaatag accttttagt ccatccccct tgaataatat ttcatcaact 4320 

gttacagtga aaaataacca ggaaaccatg attaaaacag attttctacc aaaagcaaag 4380 

caaagagaag ggactttttc aaaatctgat tctgaaaaaa tagtgaatgg aactaagaga 4440 

tcaagccgga gagctggtaa agctgaacaa acagggaata aaaggtctaa gcccttaatg 45 00 

agatctgagc cggagaaaaa tactgaggaa tctgttgaag gcattgtagt cttagaaaat 4560 

aacccacctg gtttgcttaa tcaaacagaa tgtgtgtcag ataatcaggt tcatctttct 4620 

gaatctacaa tggagcatga caatacaaag cttaaagcag caacagtgga aaatgctgta 4680 

ttattggaaa ctaatactgt agaggagaaa aatgtagaaa ttaatttgga atccaaagag 4740 

aatacacccc cagtagtaat atcagcagat caaatggtaa atgaggatag tcaggttcag 4800 

ataactccaa atcagaaaac ccttagacgg tcttcaaggc gacgttcaga agtagtagag 4860 

tctaccactg aaagccaaga taaggaaaat agtcatcaaa aaaaggaacg acgtaaggaa 4920 
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gaagaaaaac ctcttcagaa gagtccattg catataaaag atgatgtgtt acctaaacaa 4980 

aaactgattg ctgaacaaac tctacaggag aatttaattg agaaaggaag taatttacat 5040 

gagaagactc ttggggaaac tagtgctaat gcagaaactg aacaaaataa aaaaaaggca 5100 

gaccctgaga acattaagtc tgagggggat ggtacccagg acattgtaga taagtcctct 5160 

gagaaactag tcagaggccg aacacggtat caaactagaa gagcatctca gggtttgctt 5220 

tccagcattg aaaactcaga atctgatagt tcggaggcaa aagaagaagg ttctaggaag 5280 

aagagatctg gaaaatggaa aaacaaaagc aatgaaagtg ttgacattca agatcaagaa 5340 

gagaaagtgg tgaaacagga atgtataaaa gctgaaaatc agtcacatga ttataaagca 5400 

acttctgaag aagatgtaag cataaaatct ccgatttgcg aaaaacaaga tgaaagtaat 5460 

actgtaatat gtcaggattc tacagtaact tcagatttgt tgcaagttcc tgatgattta 5520 

ccaaatgtgt gtgaggaaaa aaatgaaact agcaaatatg cagaatattc ctttacaagt 5580 

ctacctgtgc cagaatcaaa tctaaggact agaaatgcca ttaagagatt acataagcga 5640 

gactcttttg ataattgtag tttgggagaa tcctcaaaaa tagggatatc agatatttct 5700 

tcgctttcag aaaaaacttt tcaaacactt gaatgccaac acaagagaag taggagggtg 5760 

aggagatcta aaggttgtga ttgctgtggg gaaaaatcac aacctcagga aaagtcactc 5820 

attgggttaa agaatacaga aaataatgac gtagagatta gtgaaacaaa aaaggcagat 5880 

gtgcaagcac ctgtaagccc atcagaaact tctcaagcta atccatattc tgaaggacaa 5940 

tttttagatg aacatcatag tgtgaatttt catttgggtc tcaaagagga taatgatact 6000 

attaatgatt cattaattgt ttctgaaacc aaatcaaaag aaaacactat gcaagaatct 6060 

cttccttctg gaatagtaaa ctttagagag gaaatttgtg atatggattc tagtgaagca 6120 
atgtctcttg aaagccagga gtcacctaat gaaaatttta aaactgttgg cccgtgttta • 6180 

ggagactcga aaaatgtttc acaggaatct ttggagacaa aagaagaaaa accagaagaa 624 0 

accccaaaaa tggaactgag tctagagaat gttactgttg aaggaaatgc atgtaaagta 6300 

acagaatcca atctagagaa agcaaaaact atggaattga atgtaggaaa tgaagctagc 6360 

tttcatggac aagagagaac caaaactggt atttctgaag aagcagcaat agaagaaaat 642 0 

aaaagaaatg atgactctga agcagacaca gctaaactga atgccaaaga agtagcaact 6480 

gaggaattta attcagatat tagtctttct gataatacta cacctgtaaa attgaatgct 6540 

caaactgaga tttctgaaca aacagcagct ggggaactag atggaggaaa tgatgtatct 6600 

gatctacact catctgaaga aacgaatacc aaaatgaaaa ataatgaaga aatgatgatc 6660 

ggcgaggcaa tggctgaaac tggccatgat ggtgaaacag agaatgaggg cataactacc 6720 

aaaacctcaa agcctgatga agctgaaaca aacatgttga ctgcagaaat ggacaacttt 6780 

gtttgtgaca cagttgaaat gagcactgaa gaaggaatca ttgacgctaa taaaactgaa 6840 

acaaatactg agtatagtaa atctgaagaa aaattagata acaatcaaat ggtaatggaa 6900 

agtgatattt tacaggaaga tcaccatact tcacagaaag tggaggaacc atcacagtgt 6960 

ctggcatctg gaacagctat ctctgagcta ataatagaag acaataatgc atctcctcaa 7020 

aaactaaggg aacttgatcc ttcacttgtg tcagcaaatg acagtcctag tggcatgcag 7080 

acacgctgtg tctggtctcc tttggcttct ccgtctacga gcattttaaa gagaggacta 7140 

aaaagatccc aagaagatga aatctcatca cctgttaata aggttcgccg tgtctccttt 7200 

gcagatccaa tataccaagc aggattggca gatgacattg atagacggtg ctctattgtt 7260 

aggtcccatt cttccaatag ttctcccata ggaaaaagtg ttaaaacttc tcctactaca 7320 

caatctaagt cagttgacct ctgtgtaaca gcttctgaaa tttattctaa gatttcagaa 7380 

atggccaaag aatccatacc atgcccaaca gaaagtgttt acccaccatt ggtgaactgt 7440 

gtggcaccag ttgacatcat tttacctcag attacatcaa acatgtgggc aagaggcctg 7500 

ggacaactca ttagagctaa gaatataaaa actattggtg atttgagtac tcttacagca 7560 

tctgaaataa aaactcttcc tatccgttct ccaaaagtgt ccaatgtaaa aaaggctctc 7620 

agaatatatc atgagcagca ggtaaaaact tag 7653 



<210> 206 

<211> 2636 

<212> DNA 

<213> Homo sapiens 



<400> 206 

gaccatgaac ttggctcccc tcccttgccc ttcctgccca tgattgtgag gcccccccca 60 

cccccagaaa cgtggaactg ttaaagtccc aatgaacctc tttccttctg taaattgccc 120 

cagtcctcaa gtatgtcttt atcagcagca tgaaaactga ctaaaagagt aaattggtac 180 

cagtagaatg gggtgttgtt gaaaagatac ccgaaaatgt gtaagcaact ttggaactgg 240 

gtaataggca gagaggttgg aacagtttgg aggactcaga agacaggaaa atgtgggaaa 300 

gtttggaact tcctggagac ttgttgaatg gctttgacca aaatgctgat gttatgtgga 360 

caataaagtc aaggctgaag tggtctcaga tgaaaatgag gaacttcctg ggaactggaa 420 

taaagacatt ggtagcattt ttcccctgcc ctagagatct gtggaacatt gaagttgaga 480 
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gaggttattt agggtatctg gtggaagaaa tttctaagca gcaaagtgtt caagaagtga 540 

cttgggtact cttaaaagaa ttcagtttta tgcattcaca aagabatggt ttggaatcag 600 

aacttatgtt taaaagggaa gcagagctta aaagttcaga aaatttgcag cc tgactata 660 

caatagaaaa gaaaaacaaa ttttctgagg agaaattcaa gctgactgaa gaaatttgca 720 

taagtaatga ggagccaaat gttaattgcc aagacaatag ggaaagtgtc ttcagggaat 780 

gttggaggtc ttcacagcat ccccttccat catgggtctg gaggcctagg aggaaaaaat 840 

ggtttcatga gctgggctca gagacttgct gcattgtgaa gtctcaggac tgcattatgt 900 

cccagtcatg gctaaaaggg gccaatgtgc agcctaggcc cttgcttcag agagtgcaag 960 

ccccaagcct tgatggcttc cacttggtgt tgtacctgca ggtgcagaga agtcaagaat 1020 

tg^gttgga caacctccac ctagatttca gaggatgtat ggaaatgcct agatgtccgg 1080 

aaatgcctag atgtccaggc agaagtttga tgcaggaatg gagccctcat gaagaacttc 1140 

tgctaggagt cccagctgtg gtactgccta gtggaacagt gggaagaggg ccaggccacc 1200 

atcctccaga tcccagaatg gtagatccat tgacagcttg cactgttaat ctggaaaagc 1260 

tgcaacggca gcccatgaaa gcagctggga ggatttttga taattcggtg tctctttcta 1320 

gaggatttca tagcaacaat atcaggtcga tacctgagaa agcatttgta ggcaaccctt 1380 

ctcttattac aatgattctc ttgatgaaag atgaatctgt tttaaagaat ttagtgcaac 1440 

aagaagctaa gaagaaagaa tctgttagaa atgctgaagc aaagattaca aaacttacag 1500 

aacagcttca agcaactgaa aaaattctta atgttaacag aatgtttttg aagaagcttc 1560. 

aggaacaaat tcacagagtt caacagcgtg ttacaattaa gaaagctttg actctaaaat 1620 

atggagaaga gcttgctcgg gcaaaggcag tggccagtaa agaaatagga aaacgtaaac 1680 

tggaacaaga tcgctttggg cagggcatgg tccgcggcgt ctgcgccggg attgccaact 1740 

attttgatgt accggtaaaa ctggtgcgta tcctggtggt gctgtcgatt ttcttcggtc 1800 

tggcgctgtt taccctggtt gcttacatca ttttgtcatt tgcgcttgat ccaatgccgg i860 

gacaacatgg cctttggtga gcagctacct tccagcagcg aattgctgga tgaagtccga 1920 

ccgtgaactg gcggcaagtg aaacgcgttt ccgcgagatg gaacgttatg tcacttccga 1980 

tactttcacg ttacgtagcc gtttccgtca actgtgagga aagttatgaa tactcgctgg 2040 

caacaggccg ggcaaaaggt aaagcctggt ttcaaattag caggcaagct ggtacttctt 2100 

accgcactgc gctatggccc ggcgggtgtg gcgggctggg cgataaaatc agttgctcgc 2160 

cgaccgctga aaatgttgct ggctgtggcg ctggaaccgc tgttaagtcg ggctgctaat 2220 

aaactggcac agcgttataa aaggtgaggg gagtttcgca aaaattgtta aatctcaggc 22 80 

gtataatgga tggcaatttt catccataga aggacgctta catgtttaaa aaaggcttac 2340 

ttgctctggc actggtgttt tcaatgcccg ttttcgccgc tgaacactgg atcgatgttc 24 00 

gtgttccaga gcagtatcag caagagcacg ttcagggggc catcaatatt cccctgaagg 2460 

aagtgaaaga gcgcattgcc accgccgttc cggataaaaa cgacaccgtg aaagtgtatt 2520 

gcaatgccgg acgccagtca gggcaagcaa aagagatcct tagcgagatg ggatataccc 2580 

acgttgagaa cgccggtggc ctgaaagaca tcgcaatgcc gaaggtcaaa ggttaa 2636 



<210> 207 
<211> 547 
<212> DNA 

<213> Homo sapiens 



<400> 207 

gtcttctcct tcctgccacc ttgtgaagaa gataaaaata aaaatgaagt caccagccct 60 

caggggtctg tcaaggcaac acacaaaatc tccagttacc ttctggtgga tgacgtttgg 120 

agacacatcg cgtccgtcac aggacacact tccgatggat ttacagcaac tcctgggggt 180 

cacgaaggtg tgttccaagg ccacatctcc aacctcacag agaggacagg aagtgatcag 240 

taccccaact agcaagagtg gcccgttcat tggccgaggt tcttgaggat agagtgggag 300 

gtgggagcgt ccctcgtgct gccttcattt ttcctaccct cagctcaggg gtctgtgcta 360 

gaggaagaca cacgaaggct actgtgcccc caccacaaga gagggaaaca gggagacgcg 420 

tgggtctgga gccaagcgta tctctctgta cccatcgcac acgaatttga atcatctgct 480 

ctttggaatc gcctacaccc tgctgcaccc acgatgtggc tgtaactcca ttaaagaatt 540 

gaatacc 547 



<210> 208 

<211> 3035 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 
<222> (1) . . . (3035) 
<223> n - a,t,c or g 



<400> 208 . 

ttttttcaaa ggggaagagt agcatgtttg tacctaggat ggaccaagag atgtttgccc 60 
agaggagggg gttgagcaaa gcgacaggtt aatagtaaaa caattaagat gacaaggaaa 120 
attactgggc ttaagatttc taactacatt tacttccttg acgaacgctt cggccgtgcg 180 
tagactggtc agcttccggg gtgacttaga gcagggctgt ggtcactttc actggcatct 240 
gggtcctgtc cgtaggatca gccggtttgg atggtctggg tcttgctggc atggtccact 300 
tgtcctgggc tgctgggttt caagccgact gtggagatcg aaggcacagt tccgtggtcg 360 
cccgcgggaa cttcaagggt ccttcctacc cgcgttgctg agagtctggg tttacgcgtc 420 
acctcgggcg ggacccgatc ctccgctcct gaggccccca caatgaagca gtcggacgcg 480 
tctccccaag aaaggtaccc gccagcctct catttttaaa ggcagagatg tgcgtctcgc 540 
catgttgccg cgtctggtct cgagctcccg ggctcaagcg atcttcccgc ctcgacctcc 600 
caaggctctg ggagtacaga cgtgagccac agcggccggc ccgtgtttaa ctcagataaa 660 
atctgggtaa cacactttca acgcttcaac cccctcgggc gcaccgctct gcgctcattc 720 
caactttgaa agcaggaagg aagaaatgcg caggctccag ccgcgtcccc gcaggcccca 780 
ctcccgtttc ctagcaacgc cgggtcacgt gcgccgccgc ccggattccg tagcgtgagc 840 
ctgccggagc cggcgcgtac atgcgagcgt gtgcgcgcgt gcgcaggcgg ggcgaccggc 900 
gtccccggcg ctcgccccgc ccccgagatc ccgccgtgcg tgcgcgcgcc cggtccgcgc 960 

ctccgccgct ttttatagcg gccgcgggcg gcggcggcag cggttggagg ttgtacgcgg 1020 

attccggcga ggaatatgga atcatggcgt ttgcgctgtt cgtgctgctg ggatctcgcg 1080 

ctgctgggca cccacggagc catccggggc tgccggcaca gtcttcacta ccgtagaaga 1140 

cctttggctc caagatactc ctcacctgca tccttgaatg acagcgccac agaggtcaca 12 00 

gggcaccgcg tggcatgaag gggggcgtgg tgctgaaaga aagatgcgct gcccggccag 1260 

aaaaccggag tttcaaggtg gactccgacg accatgtggg gatgaagtac tcctggcgtc 1320 

ttcctccccc gagccccatg ggcacgggcc aaccatccaa gcttccacgg ggcctcccag 13 80 

agttgaaggc ttgtgaagtt cgttcagaac acattcaacg agggggaaga cgggccttgt 1440 

tgggtcttgc aagttcagag ttcgtgccac cttgtcactg actggggcct tggtacaaga 1500 

tcactgactc tgagccacaa ggcccttcat tgaacgctca gagaacaagg ttctttcgta 1560 

ggtccctccg caggcccggt caaaagctac caacattgag aacctgaaca catagggccg 1620 

cccccggcca ggaaccggtg caacggcacc agctccaaag ggctccgacc gaggccatta 1680 

caattcctcc gcgtgcgcac agccacctga gcagccctct tggcccttcc tgggcatcgt 1740 

gggttgaggt gctggtgctt ggtcaccatt catcttcatc tacgagaagc gccggaagca 1800 

ccgaggacgt cctgggatga tgacgacgcc ggctctgcac ccctgaaaga gcagcggggc 1860 

agcaccagaa ttgacaaagg caagaaacgt tccgccagag ggaatttttc ctgagggcag 1920 

gttggcccga ggacgctcgc ctggttccac tgtctgcgcc gccgccggag ttccactccc 1980 

agttgcttgg aagatttcca aggttctcac ctctttaaag aaaacccact cccgtaggaa 2040 

tcccattcat tacacttcct tcttttttaa aaaaagttgg ggttttcttc catttcaggg 2100 

attcttgttt cctttaggtt tttttttcct ttctggaagt gtttacgaga agccccggga 2160 

gcttgcttgc ccttgcgggc cccgtctttt ggtttttaag ccctctgggg tctgaagtca 2220 

ttggccgggt tggggcgcaa cagccttctt ccactgggcc gggagtcagt gccagggtcc 2280 

ttgccccttt gtgggaaagt tcacaggtca acacgaaggg gcccccgtgt ccctgccttg 2340 

tctgaaagcc aaatgctgtc tgggttgcgc ccatttttgt tgctttttat gttttaattt 2400 

ttatgagggc ccacggggtc tgttgttcga ctcagccctc aggggacgaa ttctgaacct 2460 

cttgggccac agaggaattc aatttgccca cacccgaggg cgaaccccgt cacagcctca 2520 

aagtcactcc caagccccct ccttgttctt gtgcatccgg gggcagctct ggagggggtt 2580 

tgcttgggga actggcgcca tctgccggga ctccagaacc cgcagaagcc tccccagctc 2640 

acccctggag gacgggccgg gttctctata gcaccagggg ctcacggtgg gaaccccccc 2700 

tcccacccac cgccacaata aagatcgccc ccacctccac cctcnnnnnn nnnnnnnnnn 2760 

ggggggcccc ggggccggaa atcccaaatt ttaaaagggg aaatccggaa aacccccggg 2820 

gggttcttta aaggggggtg ggggccccat caaaattttt ttatttaaaa cccggttggc 2880 

ccttttttta ggaaaaaaac ccccctaaag ttaaaatttg ggccgggaac ccctgggctt 2940 

aaaaaatttt ttaaaggccc aaaaataagg ccccattttt ttttgggccc aaattttttt 3000 

aagaaaaaat tttaagaggg ctttattaaa aactt 3035 



<210> 209 
<211> 939 
<212> DNA 
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<2l3> Homo sapiens 



<400> 209 

atgaaggtct tattagagtc agttaaagaa agagcggaag aagagaaatt ggcagcagca 60 

catcttagga gttttgctgc aaaaaaagca aagaaatatg atagtgtcaa aaaggaaaaa 120 

actcttcagg atgtggactt aactcagcat cagcacaaac agacccgagc tctgtctgga 180 

ggcctgaaga ggaagctctc ccttggcatt gctttcatgg gcatgtcgag gaccgtggtt 240 

ctggatgagc ccaccagtgg ggtggaccct tgctcccggc atagcctgtg ggacattctg 300 

ctcaagtacc gagaaggtcg tacgatcatc ttcacaaccc accacctgga tgaagctgaa 360 

gcgctgagtg accgcgtggc cgtcctccag catgggaggc tcaggtgctg cggtcctccc 420 

ttctgcctga aggaggcata tggccagggg ctccgcctga cactcacgag gcagccttct 48 0 

gttctggagg cccatgatct gaaagacatg gcttgtgtta catccctgat aaagatctat 540 

attccacaag catttctcaa agacagcagt ggaagtgagc tgacctacac cattccaaag 600 

gacacagaca aggcctgctt gaaagggctc ttccaggccc tggatgagaa cctgcatcag 660 

ctgcacctga cgggctatgg gatctcagac accaccttag aagaggcaga agggaggacc 720 

gcagcacctg agcctcccat gctggaggac ggccacgcgg tcactcagcg cttcagcttc 780 

atccaggtgg tgggttgtga agatgatcgt acgacctggg tccaagcaca gggtgcatca 84 0 

gcaccaggag gacaaaggcc ccaggaggac cttccctcct ttcctcagga tggtagatcc 900 

agggcacaat ttaaagatcc tcaccaattt tcaaactga 93 9 



<210> 210 
<211> 1746 
<212> DNA 

<213> Homo sapiens 



<400> 210 

atggcttctc atgcttatga taaaaatcaa aatgccaatg tcttagtcca cttgtgcttc 60 

tataacagaa tacccaagac tggagcatat tacctagatt ctaggagtgt gagcatcagt 120 

tatttgattg gccatcatat agacatgggt cttgagacag ccacttccaa aaatgaattc 180 

attttcgata gtgcaagtac actgctgggg atgcttttta ggaaaccctc ccaacactct 240 

ctctcccttt ttagcaagaa attccaggag aatctgatat acctggaatc tgatgactgc 300 

ttgccacctc ctccccctcc accttggtct gagccaccgt catttctcac ctggactatc 360 

gtcacagttt tccagtgggt ctccctgctt ctgtccttgc ccaacataca ggtcattctt 42 0 

tacagagcag ttggagtagt accttcacaa cctaagtcag acaacctgaa aggatgggga 480 

tcggggagag tcgttaaaga gaagctcaga tcagagatac ctgactggaa gattaaaagt 540 

attcacatac tggagaggac tgcttcctct agcaccgagc cctctgtaag tcggcaattg 600 

ctagaaccgg agccagtccc cctctccaag gaagctgaca gctgggaaat tatagaaggg 660 

ctgaaaatag gccaaaccaa tgtccagaaa ccagacaaac atgagggctt tatgctgaag 720 

aaaagaaaat ggcctttaaa aggctggcac aagattcaga aaggaaaggt ccatgggagc 780 

atagatgtgg gactctcagt catgtcaatt aaaaagaaag ctcgaagaat agaccttgac 840 

accgaagagc acatctatca tttgaaggtg aaatcagttt tcaacagttt ctctgccatt 900 

atcaggggaa acgatttgcc tacaccagtg gtgaaatccc aggactggtt tgatgcatgg 960 

gtctccaaac tgcgacatca tcggttgtat cgtcagaatg aaattgtgag atcaccaaga 1020 

gatgctagtt ttcacatatt tccttcaacg tccacagctg aatcctcacc agctgctaat 1080 

gtttctgtaa tggatggaaa gatgcaacca aacagctttc cgtggcagtc ccctttacca 1140 

tgcagcaata gcctccctgc aacgtgcaca actggccaga gtaaagtggc agcctggtta 1200 

caggactcgg aagagatgga caggtgtgca gaagatcttg cacattgcca gtcaaacctt 1260 

gtggaactta gcaaactcct gcaaaatttg gaaatacttc agagaactca gtcggcacct 1320 

aactttactg acatgcaggc taactgtgta gatatttcaa agaaagacaa gcgggtcaca 1380 

agacgatgga gaacaaaaag tgtcagcaaa gatacaaaaa tacaactgca ggttcctttc 1440 

agtgctacca tgtcaccagt tcgcttgcat tcctccaacc ccaacctttg tgcagatatt 1500 

gaatttcaga ctccccctag ccacctcact gaccctctgg aaagttcaac agattataca 1560 

aagctgcaag aagaattttg tctaatcgca cagaaaggaa agggggcatc aaaaaagcaa 1620 

gccaaaagga atgctgctga gaaatttctt gccaaattta gtaatatttc tccagagaac 1680 

cacatttctt tagtgagtaa tgttgacagc tatgatgtta atgtcattaa gcatttcttg 1740 

cagtag 1746 



<210> 211 
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<211> 1251 

<212> DNA 

<213> Homo sapiens 



<400> 211 

atgctcagaa cgcgtaaagc accccacagc tgggtaaaat catctagcaa cacagtgcac 60 

tatagagtat cagttgtttg ccttcatgat catgtaactg actgggagtg gcagctcact 120 

gccactgccc gacatccaaa gagagtatca cactacatat tgtgggacca ggagaagacc 180 

aaaattaaaa ttcgaaaaga tataatccgc atcctcccca gtctagatgt cgaagtcaaa 240 

gacatcactg attcttacga tgctaactgg tttcttcagc tgttatcaac agaagatctt 3 00 

tttgaaatga ctagtaaaga gttccccata gtgactgaag tcatagaagc acctgaagga 360 

aaccacctgc cccaaagcat tttacagcct gggaaaacca ttgtgatcca caaaaagtac 420 

caggcatcaa gaatcttagc ttcagaaatt agaagcaatt ttcctaaaag acacttcttg 480 

atccccacta gctataaagg caagttcaag cggcgaccga gggagttccc aacggcctat 540 

gacctagaga tcgctaagag tgaaaaggag cctcttcacg tggtggccac caaagcgttt 600 

cattcccctc atgacaagct gtcatccgta tctgttgggg accagtttct ggtgcatcag 660 

tcagagacga ctgaagtcct ctgtgaggga ataaaaaaag tggtgaatgt tctggcctgt 720 

gaaaaaatcc tcaaaaagtc ctatgaggct gcgctgctcc ctttgtacat ggaaggaggt 780 

tttgtagagg tgattcatga taagaaacag tacccgattt ctgagctctg taaacagttc 840 

cgtttgccct tcaatgtgaa ggtgtctgtc agggatcttt ccattgaaga ggacgtgttg 900 

gctgccacac caggactgca gttggaggag gacattacag actcttacct actcataagt 960 

gactttgcca accccacgga gtgctgggaa attcctgtgg gccgcttgaa tatgactgtt 1020 

cagttagtta gtaatttctc tagggatgca gaaccatttc tagtcaggac tctggtagaa 1080 

gagatcactg aagagcaata ttacatgatg cggagatatg aaagctcagc ctcacatccc 1140 

ccacctcgcc ctccgaaaca cccctcagta gaggaaacaa agttaaccct gctaacctta 1200 

gcagaagaaa ggacggtaga cctgcccaag tctcccaaga gaagaagata a 1251 



<210> 212 
<211> 909 
<212> DNA 
<213> Homo sapiens 



<400> 212 

atgtttgcgt ttgaacctct tggcgggtgc cggccatggc ggctctcgct cccgggactt 60 

Srggagccggc ttttccgcac atatggggct gcggacggca ggagacagcg gcggccgggc 120 

cgggaagccg cgcagtggtt cccgccgcag gaccggaggc gtttcttcaa cagcagcggc 180 

agcagcgacg . ccaggatggg cgacccctcg cagtccgacg atcctgacga tcccgacgac 240 

cccgacttcc ccggcagccc ggtgaggcgg cggcggaggt gtcccggcgg ccgagtgccc 300 

aaggaccggc ccagcctgac cgtgacccca aagcgctgga agctgcgagc tcgcccaagc 360 

ctaaccgtga ccccaagacg cctggggctg cgagctcggc ccccgcagaa gtgcagcaca 420 

ccctgtggcc cgctccgact tccgcccttc cccagccgcg actccggccg cctcagcccg 480 

gacctcagcg tgtgcggcca gcccagggac ggcgacgagc tgggcatcag tgcctccctg 540 

ttcagctctc tggcctcgcc ctgccccggg tccccaacgc caagggacag tgtcatctcg 600 

atcggcacct ccgcctgtct ggttgcagcc tcagccgtcc cgagcggcct ccacctccca 660 

gaagtctccc tggaccgagc atctctcccc tgctcccagg aggaagcgac aggaggagcc 720 

aaggacacca ggatgggcag tgtccgagtc ctgcgtgacc ctgtgggggt caacctctat 780 

gagcattcgg tcagcaaatg ccacgtcggc cagccggaca ctgatcccag agagaaggtg 840 

aaggctgctc cggaagagct gtgtctgcac gcgctccagc acccccgctc tgagcaggct 900 

gactgctga 909 



<210> 213 

<211> 1188 

<212> DNA 

<213> Homo sapiens 



<400> 213 

tcagttcttt tgcttgttaa cgtgtctgct caccgcctcc cgctccccaa agccaggact 60 
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gtgcagttga tcccaggagc gagcccggcg cggctccctg ccggcagcga agcgaaacat 120 

ttggacatgg ccccgatttt ccgggtgcgg ttggcaagtc ccggcgtgcg cgaaccaaac 180 

ctgcggttcc cggggcagct ttgagcgggt cgggtcggct ccataggaga atggccccac 240 

tgtttccaga aggatccaag agaggacttt cagctctgga attttgaact caggatctgt 3 00 

ttctccctcg cggaaagagg aaggtggacg aagggcaagc ccaggccggc aggggctacc 360 

gcaggaagac gcccattcct ggactcgggt tcgcaggagc ccccttgctc cccagagccg 420 

gaactgcgct gcctgccacg ctcggctgac tccaaggaag tccagggcaa cgtgattgtc 480 

catggccact tgtgttggtg cattccgaac aaacagtctg ctgcctggct gcacaggtct 540 

gaccagtgtg gtcatgggtg ggtgacgatg taacaaggtg cattctggct gctgttttcc 600 

agccccagat ccttcttcct cctctcggtc ccaggtggct tccagagtca tcccgctggc 660 

tcctcctgca tggcaagtcc cagttagctg tacagaatct gcagaaggtg gctcatcggg 72 0 

gagattggcc tggatctggt catccagctc cacagtccca gcactcttct ctgagaagaa 780 

gcgcagcacg aagtcgccct ccttgttggg ctcgaaggtg gagggcacca ccacatactc 840 

cccgggtggc aggcggaagc gggtgctgac ctctcgcagg ttgatgaact gctctgagcg 900 

cgcccgagac gcattggcca ggaagaagtc acgcttcaag tgtacggccg gctggcccac 960 

cagctcgtag accgcgaagc caatagtctc catgtcgcgg ccgaagcggc gctcgcggcg 1020 

acggtgcttc tgcataaggg cgagcacgaa gctgcagcct gactcgcggt ccccgtagtc 1080 

gtccgggtca tccgtctcat ccagccggat cttgaactga gggttcaccc agaaggtggc 1140 

ttctttctta tgatgtaatt tactttgaat aattaaggct ccctccat 1188 



<210> 214 
<211> 749 
<212> DNA 
<213> Homo sapiens 



<400> 214 

ggggagaatc gatgacatta ggccactgga ggccattagc gagtgagctg agctgcgcgg 60 

cgcggcgcgg cgcggcacgg cacggtggga gtgtctccgg ctggcctgca gggagaacac 120 

cgactgagac ctctaaccct ggctccagtg gcatgtgata accagtcgtt ctgctcagtg 18 0 

gtgggggtgg tatttgccga gccagaatgt gtgtgctgtt ttaggaatcc gtcacctttg 24 0 

aggatgtggc cgtggagttc atccaggagt gggcattgct ggacagcgca cggaggagcc 300 

tgtgcaaata caggatgctt gaccagtgca ggaccctggc ctccagggga actccaccat 360 

gcaaacccag ttgtgtctcc cagctggggc aaagagcaga gccaaaggca acagaacgag 420 

ggattctccg tgccacatgt gttgcctggg aatctcaact taaacccgaa gagttgcctt 48 0 

ctatgcagga tcttttggaa gaagcatcct ccagggacat gcaaatgggg ccggggctgt 540 

tcctgaggat gcagctggtg ccctccatag aagagaggga gacaccattg actcgagagg 600 

accgaccagc tctccaggac ccgccttggt ctctgggatg cacgggactg aaggccgcta 660 

tgcagattca gagggtggtg ataccagtgc ctactctggg ccaccgcaac ccatgggtgg 720 

ccagggattc tggtgccatt ggaaatggg 749 



<210> 215 
<211> 723 
<212> DNA 
<213> Homo sapiens 



<400> 215 

cccctgcctc ttgatcaaag gctcctggcc agcattacac cctcaccttc aggccaatcc 60 

atcatccgaa cccaacctgg agcaggggtc cacccaaagg ctgatggtgc tctgaaggga 120 

gaagctgagc aatctgcagg gcaccctagt gagcacctct tcatctgtga ggaatgtggg 180 

cgctgcaagt gcgtcccctg cacagcagct cgccctctcc cctcctgctg gctgtgcaac 240 

cagcgctgcc tttgctctgc tgagagcctc ctcgattatg gcacttgtct ctgctgtgtc 300 

aagggcctct tctaccactg ctccactgat gatgaagaca actgtgctga tgagccctgc 360 

tcttgtgggc ctagttcttg ctttgtccgc tgggcagcca tgagcctcat ctccctcttc 420 

ctaccctgcc tgtgctgcta cctgcctacc cgtggatgcc tccatctgtg ccaacagggc 480 

tatgatagcc tccggcgacc aggctgccgc tgcaagaggc acaccaacac tgtgtgcaga 540 

aagatctctt ctggtagtgc acccttcccc aaggcccagg aaaagtctgt atgaccttcc 600 

aacaaggtgg atccagagct tttctccttc tagtccccaa cagcaaagca taggcctcat 660 

ctttggagag ggggaggagt gataaactag ccaaagttag ggcctctctt ttgttcctgc 720 
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agg 723 



<210> 216 

<211> 1572 

<212> DNA 

<213> Homo sapiens 



<400> 216 

atgaagacgt gccttgcttt tcccttggcc ttccaccatg attcaaacgg atcggcggtc 60 

ctaagacttc aagcaaccca cggaatttct agtgacagcc ctctcccatc tgtagaaaag 120 

gcgagccatt ctccggaccc cagcgaatac tttaggaagc acccacctgt tcgcagatca 180 

gggctccgga ctaagcgcac gagcccaggc ccgggagccc gcgtacctgg cagtcagtct 240 

ttccgcagcg ccgaggcttg cggtgtggcc gccctggaat gttggcgccg tcgtgtccct 300 

gtcccccttt cgtcgccagc tgaggtgcag gtcttgctca agaaggcgct tcgacctgaa 360 

agcaggcctt tcagaaatca gatcttacac aattgtgaga gaaactgggg gaataaaggt 420 

tggaaagggc tcgttggaag atcggagtcg caaaccggac aatctgagaa gctgagcatg 480 

tccagccacc acagagggac tgtgcgggag gagctggtgg tagaggaata tatagggggt 540 

tggtgcctct gtgggtctgc atggaagctt ctggtgacag ggctggagca gctgctcttc 600 

agcaggaccc gccctcagga ggaggctgtg gacaaaacct ggaggacagc caggcagctg 660 

gagtcaggga cacttttgtg tcgtcattgc atcacattgc cctggccttc tgagaggaat 720 

ggtggttgtt tcctttctcc ctccaacatg cttgtctgtg agttaagggt tctttctgtc 780 

atcgttgcat ccccagagcc tagtacagaa catacacagg aacacttatc tggagatgaa 840 

tttgagaaat cccaaccaag tcgaaaagag aaaagtttag gattattgtg tcataagttc 900 

ttagcacgat atcctaatta tcccaaccct gctgtgaata atgacatctg ccttgacgaa 960 

gtggcagagg aacttaatgt tgaacgtcga cgcatttacg atatcgtgaa cgtcctagag 1020 

agtttacata tggtgagccg cctcgccaaa aacaggtaca cttggcacgg gcgacacaat 1080 

ctcaacaaaa cccttggcac cttgaagagc atcggggagg agaataagta cgccgagcag 1140 

attatgatga tcaaaaagaa agaatatgag caagagtttg actttattaa gagttacact 1200 

tctgtaaaca gccgcaaaga caagtcttta agggtaatga gccagaaatt tgtgatgctg 1260 

tttttggtgt caacgcctca gatagtaagc ctagaagttg ctgccaagat tttaattggg 1320 

gaggaccatg tggaagattt ggataaaagc aagtttaaaa ctggctccct ggttcggctc 1380 

tttgcccctt gcctctctgg tgctggcagc aagcttccag gcctggtaga ggcactggcc 1440 

tttgaggtat cattggctgc cttctcagtg gtggcatctg tggagtcttt tgagccagta 1500 

gctttagaag agtgggtggt gcacaccgtt gggctcaggc cttggggtgg cgtcacactt 1560 

tggtgggctt ga 1572 



<210> 217 

<211> 719 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (719) 
<223> n = a,t,c or g 



<400> 217 

agcccgggtt tgcattaaat tttttaaaag gccatagtct cctcaactct attgaattgg 60 

ggttaataag actccacaaa attgtcaagc agaccaatta actagggaca aatttcaaaa 120 

cctaaaaatt tagtttgcgt gtaacgtggg actgcacaca agtttggaaa tcatggcaaa 180 

acatcaatta cctaggacta caaaacatta atttcttctt ttaagcagag agaatatgga 240 

tttaattgtt taccataaga aaagtgacat atccaaccaa ccatccatcc ccacctgtgc 300 

cctattcttt ccttgtgttt ctttagagcc ttttcagcta tttcctgtga agcaaactgc 360 

acgaaggcct cccccgtact cctcccctgg gaagtccacc ggcaatgtta tcccatttgg 420 

gcacggattt ccaacccttc aacccaaggg ncaaataacc ccagtagggg ggcaatatta 480 

acatcacaag cccagaaatg attcttctta taggctttaa ataaaccagg actttttaac 540 

tttagggtga atgggtatgc tttcaacaag tactctttaa atatgcattg caataatatg 600 

a'agttccatt attataaagt ataactattc ttaacacacc catggtacag tatatattaa 660 
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aaaaaacttg ctgaacacag gatgcattaa gaattcaaca acttaagaac acatatctt 719 



<210> 218 

<211> 1755 

<212> DNA 

<213> Homo sapiens 



<400> 218 

atgaggcaaa ccaagacaga atacatccaa gagtttaacc aggaggccac agttgcaagg 60 

gctttggaag gtcaggaaaa gcccaccgaa gggccaagaa acacctgcct ggggagcaac 120 

aacatgtacg acatcttcaa cttgaatgac aaggctttgt gcttcaccaa gtgcaggcag 180 

tcgggcagcg actcctgcaa tgtggaaaac ttgcagagat actggctaaa ctacgaggcc 240 

catctgatga aggaaggttt gacgcagaag gtgaacacgc ctttcctgaa ggctttggtc 300 

cagaacctca gcaccaacac tgcagaagac ttctatttct ctctggagcc ctctcaggtt 360 

ccgaggcagg tgatgaagga cgaggacaag ccccctgaca gagtgcgact tcccaagagc 420 

ctttttcgat ccctgccagg caacaggtct gtggtccgct tggccgtcac cattctggac 480 

attggtccag ggactctctt caagggcccc cggctcggcc tgggagatgg cagcggcgtg 540 

ttgaacaatc gcctggtggg tttgagtgtg ggacaaatgc atgtcaccaa gctggctgag 600 

cctctggaga tcgtcttctc tcaccagcga ccgcccccta acatgaccct cacctgtgta 660 

ttctgggatg tgactaaagc gcttgtgggfc ggttatgaca tccttgccat ctatgaggtg 72 0 

gagcatttcc aacaagagca gtgtgtggcc gtgatctccg tgtgctcccg cgggggaaag 780 

ggcagtgctg agtgtgggca ctggggcaag ggcctgacca ccgagcactc ctcacctgtg 840 

cctgcagtca cccacctttc cttggcaagg atagcggaaa aaggcaaagc cggatgccct 900 

gcacgggcct gccgggtgca ctcctgggtc cttgtgctat cggggaagcg cgaggtgcct 960 

gagaacttct tcatcgaccc attcacagga catagctaca gcacccagga tgagcacttc 1020 

ctgggcatcg aaagcctgtg gaaccacaag aactactgga tcaacatgca ggattgctgg 108 0 

aactgctgca aggtgcctag ggagggggag ctgggtgacc ctgaggaacg gctcgctcat 1140 

ttgtttggtg gcttggactg tctgtccatc tgcatggcgg acgtagccag tctgctgctc 1200 

ctgaactccc gtctcaccat cctgtcccag ctgccctcat ggtgtatgag cctcttcctg 1260 

caacaacagg acttgatctt tgacctgggt gaccctgtga gatgggagta catgctcctg 132 0 

gggactgata agtctcagct gtccttgact gaagaagacg acagtgggat aaacgatgag 1380 

gatgatgtgg aaaatctggg caaggaggat gaggataaga gcttcgacat gccccactcg 1440 

tgggtggagc agattgagat ctccccggaa ggtaccaata ttttggagat aaaggagtgg 1500 

taccagaacc gggaagacat gctggagctg aaacacataa acaagaccac agacctgaag 1560 

acagactact tcaagcctgg ccacccccag gctctgcgcg tgcactcgta caagtccatg 1620 

caacctgaga tggaccgtgt cattgagttt tatgaaacgg cccgtgtgga tggcctgatg 1680 

aagcgggagg agacacccag gacaatgaca gagtactatc aaggacgccc agacttcctc 1740 

tcctaccgcc atgcc 1755 



<210> 219 

<211> 1437 

<212> DNA 

<213> Homo sapiens 



<400> 219 

atggagatca gaaggtccac cctttcagca ccccctctcc agggacaccg gccttccaca 60 

ttcccacaac cttctcccct gctgcaggac ctgggcatca cttacctatg gaccctggag 120 

agggcttggc agaaggacct gggttacctg cagcagtggc tgaaggcctt tgtaggtgcc 180 

ttcaagaaga gcatctcact gtcctctctg gagccacgaa ggccagagga ggcaggtgca 240 

gaggtcccgc tgctaccact ggatgagctg catgtgctgg ccgaacagct gcaccaggct 300 

gacctggagc aagccctcct gctgctcaag ctcttcatca ttctctgcag gaacctggag 360 

aacatagagg caggccgggg ccaagtgcta gtgccccgag tgctggcact gttgaccaag 42 0 

ttggtggcgg agctgaaagg atgcccacca ccccagggcc gaggcacgca gttggagaat 480 

gtggccctac atgctctgct tctctgcgag ggcctctttg acccttacca aacctggcgg 540 

cgccagcgca gtggggaagt catcagctcc aaggagaaga gcaaatacaa gttccctcct 600 

gctgctttgc cccaggaatt cagcgccttc ttccaagacc ccagacacag gaaccaggat 660 

tctaaaccca gaccccagat ccaggactca agccccagga ctctaaaccc agacttccag 72 0 

gactgcaaac ccaaacccca gacccaggac ccaggacaag ggctgacgga tttccagatc 780 
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agcgtggccc tgtgtggaca aggccagtgg gaaggggcac cggggggtca cactcttggc 840 

aacagcgggg gtagctgctt ggcagacctc cctggtagcg ttcagaaggg catgaagaat 900 

gtccatcatc ctgtcagagc caatttgggg agctcagact atggttgcca cgtggagggt 960 

gggcagaagg agtctccgca ggcagccatg gcccgccctg gcctgccggt cccgcccctc 1020 

ctggcccaca gcctctgggc tgggacgcag gcccacactt cccggaggag tcgagaacac 1080 

ggcctgggcc agacccagag ccttagaagg accacgggac ggtgctctaa ctcacccaaa 1140 

gaaagcacag agatgggcac aaagaaccca ggaccaaaat acaacttcct gtccccggag 1200 

ctgagggcca gacttgcccc ccagagccct gggctggggc ccacaggagg aacgcaaact 1260 

ggcaggccca aggctcgagg agagaaaggc cttcaggttt tcagactccc aggagccact 1320 

gggtacccag tgctggcaga tcaggctttc tfcgggcctca gttttcctgc ctgtgacaca 1380 

ggcaagaacc acctccagcc cccctctgat ttctctgggt gccttaaact tgagtga 1437 



<210> 220 

<211> 357 

<212> DNA 

<213> Homo sapiens 



<400> 220 
gtgcgattca tctttctgtc tctaggtctt 
gagtgccttc agtgaaggaa taaatgacca 
aaccctgagc atcatcgggt gcgcccacca 
gaaatccagt tctgttttgt ttctgaaata 
ggctttggfcg gcctacactg tgctttcgga 
agctactgat gaaggacaaa aagaattttc 



<210> 221 

<211> 339 

<212> DNA 

<213> Homo sapiens 



<400> 221 
gtgcgattca tctttctgtc tctagttctt 
gagtgccttc agtgaaggaa taaatgacca 
aaccctgagc atcatcgggt gcgcccacca 
gaaatccagt tctgttttgt ttctgaaata 
ggctttggtg gcctacactg tgctttcgga 
agctactgat gaaggacaaa aagaattttc 



<210> 222 

<211> 2485 

<212> DNA 

<213> Homo sapiens 



<400> 222 

ggggaagtac tacaagctgt cctcaggtac tgcccctacg tgtgtttccc tgggatgggg 60 

ccttgctaga ggtgattctg ctgccccagc attgggaagc aggaccagtg catgtgcccc 120 

atgcagccac ggtacatgga agctgtcact tgagccatca gaccggctca gcccatgtga 180 

ccggagctct gaagaggcac acacgcatgc cccgcacagg ctgttagcac ttgtcgcctc 240 

gctgccctgg tccagactac ccttgcttgc cccacagagc cactcagagg ctgaagccac 300 

ttcacagccc actggtgttg agaatcatca tcagaaaacc agatatgtca aggcaggtgg 360 

tccagtcatc tgccggagtc tgcctgagtc cagggggttt ttatgggctt cagaggggag 420 

gaagtgcatg ctgattggtt cgtgggcagc catgggcagg ctcagaaaaa gcaccataag 480 

ttctcgtttt gggccccaga ctctagctgg aactggcaga cctcaggcca tccctgtctt 540 

gaagaaacac agtgatgcag tcctgcttgg ggtgtgcttc ctgaagctac tgcaccagca 600 

tcaccaggaa cttggtgaaa atgcagattc tcagacccta ccccagaccc actgggagtt 660 



agcatgtgca cggcacagag caggcgctca 60 

taccaccagg cacagagtga ggcctttaga 120 

cttctgtatg tgactcaaag taagtgtaag 180 

ctagatgtca ggaagggtgt ttaggtgtca 240 

attattcact gaggcgaaag aacagagatt 300 

tgctgaatcc tgaatttgcc aggatga 357 



agcatgtgca cggcacagag caggcgctca 60 

taccaccagg cacagagtga ggcctttaga 120 

cttctgtatg tgactcaaag taagtgtaag 180 

ctagatgtca ggaagggtgt ttaggtgtca 240 

attattcact gaggcgaaag aacagagatt 300 

tgctgaatc 339 
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tatattgtca gaagattata acaagatgac tcctgtgaaa aactatcaag tgttggaagt 720 

tct 9Srccagg gcaatgaggc aggagaagca aataaagagt attcaattag gaaaagagga 780 

agtcaaattg tctgtgtttg cggatgacat gattgtgtat ctagaaaacc ccattgtctc 840 

agcccaaaat ctccttaagc tgataagcaa cttcagcaaa gtctcaggat acaaaataaa 900 

tgtgcaaaaa tcacaagcat tcctatacac caataacaga caaacagaga gccaaatcat 960 

aagtgaactc ccattcacaa ttccttcaaa gagaataaaa tacctaggaa tccaacttac 1020 

aagggatgtg aaggacctct tcaaggagaa ctacaaacca ctgctcaacg aaataaaaga 1080 

ggacacaaac aaatggaaaa acattccatg ctcatgggta ggaagaatca atatcatgaa 1140 

aatggccata ctgcccaggg taatttatat attcaatgcc atctccatca agctaccaat 1200 

gactttcttc acagaattgg aaaaaactac tttaaagttc atatggaacc aaaaaagagc 1260 

ccgcattgcc aagacaatcc taagccaaaa gaacaaagct ggaggcatca cgctacctga 1320 

cttcaaacta tactacaagg ctacagtaac caaaacagca tggtactggt accaaaacag 1380 

aggtgtagat caatggaata gaatagagcc ctcggaaata ataccacaca tccacaacca 1440 

tctgatcttt gacaaacctg acaaaaacaa gaaatgggga aaggattccc tatttaccaa 1500 

atggtgctgg gaaaactggc tagccatatg tagaaaactg aaactggatc cgttccttac 1560 

accttataca aaaattaatt caacatggat taaagactta aacgttagac ctaaaaccat 1620 

aaaaacccta gaagaaaacc taggcattac cattcaggac ataggcatgg gcaaggactt 168 0 

catgtctaaa acaccaaaag caatggcaac aaaagccaaa attgacaaat gggatctaat 1740 

taaactaaag agcttctgca cagcaaaaga aactaccatc agagtgaaca ggcaacctac 1800 

agaatgggag aagattttta caatctaccc atctgacaaa gggctaatac ccagaatcta i860 

caaagaactt aaacaaaatt tacaagaaaa aatcaaacaa ccccatcaaa aagtgggcaa 192 0 

aggatataaa cagacatttc tcaaaagaag acatttatgc agccaacaga cacatgaaaa 198 0 

aatgttcatc atcactggtc atcagagaaa tgcaaaacaa aatcacaata agatacatct 2040 

cacaccagtt agaatggcga tcattaaaaa gtcaggaaac aacagggaca tggatgaagc 2100 

tggaaaccat cattctgagc aaactattgc aaggacagaa aaccaagcac cgtatcttct 2160 

cactcatagg tgggaattga acaatgagaa cacttggaca caggttgagg aacatcacac 2220 

gctggggcct atcgtgggtg tgatctgtag aaaagtattc cctggaaatt cagggccttc 22 80 

aaaacccagt gggcttcact tctcccagcc cctaccccaa gttacttcag ttgtggcaaa 2340 

aataacaatt gtcccctggg aaatgaagct cattgctatg ggtgttcaag atgaactcaa 24 00 

cattgccttc cacaaaaatc acttactgat gaatgacacc accattcata tgacacccta 2460 

catccagcca gcccccaagt cctga 2485 



<210> 223 

<211> 2086 

<212> DNA 

<213> Homo sapiens 



<400> 223 

atgagggtgt gcgcgcgcgc tcggcgagcc cgggttcttt gtgtgtccga gcccatgcgc 60 

gcgatgggtg atgctgagag ctgcctgctc agacaacaga cacgcgaggt caggaagaag 120 

ccgcttataa attaccgctt ccttcgcgcc gccgccaacg ccgagccccg aggaccgcaa 180 

gcccagagga caagctgcgc caagagggag tgcggagcgt tcacccagcg ggggtcagcc 240 

ctccacaagg cagtccgggc tctggggact ctgttttcta acctgaactt gtccgcctgt 300 

cagtttgtca aaggatttgg gaggtgtact tctctgaaag tctttccaaa agtcctgaat 360 

gataagaaac ttatccagaa ccagaaccac agcccttctg aggagctccc aaacccctgc 420 

cggcttcgtg acctggagga gtcacagaac taccctgagc ctcaggtgag aaaactgggg 480 

ctcaggaagg cagtgagaaa ctggaagtca gccacgtgtc aggacaggag cctggattta 540 

gcccacatct ttggctttgg ttttgcacac agctgctgct gggagctggg gccagggcag 600 

gcagaaataa tcctgggctc tgcagagcct gtttcaggct catggtctac cctaccccgc 660 

ccccatttcc ggcaaaacca ggctgagggc tgggctgcta gcgaagctga cagcaagagt 72 0 

gacagccctg tcctgtcgct gcaacaggaa aacaacccca aaaccccagc cttaagagca 780 

agtccaacag tgcgcgactc cccaggtcag gagaagcccc gatgtgtgca gaatctgcag 840 

atggggctgc tctcggccct gcctctggct gtggaaacag gcctccaagt acccaagagc 900 

cgagcttctg ggtttgaatc cagctgtatg accctgggca agtggcttta cttctctggg 960 

cctctttcca tcatccgggc tggtgaggaa attcactttc actacgacca gaatattttt 102 0 

gctcattgtt tcggctttgt tccgggcaaa acaatgcctg gcttcctgga tgacctcacc 1080 

gggacaagcc atcagggctg taggccagcc tggtcagcat ctccgcacga gcccattctt 1140 

ccctttgcca cggccacctt agtcaagccc ctgcttgtca ccggcctgcc tgggaccctt 1200 

ggagcagccc tcctgtccca cgactttacc ccaaagagct ctctagtctg cacccctgtc 1260 

tctgattcat cacccaacat ccatcaggtt ggcagctgta acaaacagtc ccaggtctca 1320 

gggctcaaca caaggaagag gcagccgctg ctgctggttg tcgtgtgtct ggtccagaga 1380 
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cactctaaac atcgtttaca caaatgggag cgtgtgatgc tgcccgcacc ccagattccc 1440 

ttacatggag gccccgctgg ctatttcgca acacccactg gcattcactt agggcgctct 1500 

gatctctcgc tgaagcaaac catgctggtg aactcccttg cctgtgtgtc atttcacatg 156 0 

tcggaaaaat cacctgaaag gaatggaaaa cttatgccta cacaaaaagt gcatgtggat 162 0 

gtctacagta gctttttcca taattgccaa aacctggaag caaccaagat gtccttcagc 1680 

agcgccaagc tggaaacaac ccacacatct aacaacagga gactgggcaa atggcctggg 1740 

gtagttttct acaagggaat atgtgacagc atgagagcga acgaaccaga gcaaaccaga 1800 

cacaaccacc tgggtgaatc tcacagatgt gcagttgagc aacagcagcc aagcaccaag 1860 

gggtgtttcc actactctgc agttcccagt gttggggaca gtggacaagt ctggtgttac 1920 

catgaccttt tgggtttaaa atggcagcat agcccagtgg tcaagggcat agcttggtgc 1980 

cagctgtgtc ctctcagtga gtcctcaact tcctcgttgc ttcagctgga agatgcagag 2040 

gccaaagttc agcaagccga gggacctgcc ttgggtctcc cagctg 2086 



<210> 224 

<211> 942 

<212> DNA 

<213> Homo sapiens 



<400> 224 

atgggcaata agacttatgg aggacaaaat cagatgttga tttttgcttt cactttgcat 60 

tccctctttc ttaattctgg agatggcaga ttgtcctttg agagctcgtc ccagaagcca 120 

gggggctttc gcaacattgc aatccaaact tcccccagtc tcaggaagca tttcccagtt 180 

ttcaaaagga agagactcac agccagtaag tccgtggagg aaatgccaac agcctcccaa 240 

agtgccatcc atgtcaacgg taacctctct gaacaggaca ttgtgtcttc tgaccttgcc 300 

tacttaaggt tggctcagca tcttgaggat gggcctcgaa gggtcaaggt gtcccatgca 360 

ttcctcccaa gggtccccaa ggtgcaaagc aatggtcctg ttagcatatg cttggaagca 420 

ggaacttgga ggtccttaga gaaagccaca gctgccattc aggttccaga tgatatttat 48 0 

cacagtcctt cctgggaagc tagagagtct gctctcagcc cagacaggtc agctgaacat 540 

aacagcctca gcaggccatc tgaccctggg ctgtctctcc agcctcagct tctgcccact 600 

ctttgtctcc cgttccatgt actctacacc aggagtcccc agtccctggg ccacgggccc 660 

atagcagtcc atggcctgtt gggaaccatg ctgcgcagca ggaggacatg gagctttctg 72 0 

tatcccgggt tcttgccttg gtgttctgga agaatcggat cacgtgtggg cttggagaat 78 0 

gagtgcaagg tttcattgag tggaagtagc tctcagccaa tgggagagcc agaagggaga 840 

tggtcttccc ctgaagttgg accactcgcc agccccggct ctcccctgat agcctgggcc 900 

aaactccgct ttgtcccgcc tgtcgatgac ctgccagtgt ga 942 



<210> 225 

<211> 1528 

<212> DNA 

<213> Homo sapiens 



<400> 225 

atggacagtc aagaagtaga gaaatacccc aacacttctg tagcttgtga ggaaatcccc 60 

ttttcaggaa tccatgttgc tggtgggaaa tcaggagcat tagaacatgg aaaggatgac 120 

cttgatgaac ccattgaaaa tcccctcttt tgtttctcaa gctttagcaa tgcacttgca 180 

atattgcttc caaaagtgtt ccttaagaat attcatatac tgcaattcat ctacaggagt 240 

tttcaccttt tgacaatggc aaaagcaaag tttgaaggtg ctgagtctgt ggagccagtg 300 

tcaccttcac agcccaaaag gccatcctat gtccccctag aagagctatg gacgaggtta 360 

acaaaaggga acagcaggcc tcagcagagg gacagggaga agggaggatg gatgaaggga 420 

gtgcagcaag gccatcaagg agtaggaaag caggaggaag gttcagagaa catcaaagaa 480 

aaagctggga ttgttgtctg tgaggtgcct aacaacaagt tagataaatt catgggaatc 540 

ctttcttgga aagacagcaa gcattccctc aacaatgaga agataatcct gagaggctgc 600 

atcctgagaa ataccagctg gtgttttgga atggttattt ttgcaggtcc tgacactaaa 660 

ctaatgcaga atagtggtaa gacaaagttt aaaaggacaa gcattgatag attgatgaat 72 0 

actctagtac tatggattat gctaatttct caaccagtgg tagaatttat catgagagga 780 

cacagttatt ttataaactg ggaccggaag atgtattatt ctcgaaaagc aatacctgca 840 

gtggctcgaa cgaccacgct caatgaggaa ctggggcaga ttgagtacat tttctccgac 900 

aaaacgggta ccctcactca aaacatcatg acctttaaaa gatgttccat taatgggaga 960 
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atctatggtg aagtacatga tgacctggat cagaagacag aaataactca gttaatacat 1020 

agatggctag ctagactaaa aaagaagaag agagagaaga atcaaacaga cacaataaaa 1080 

aatgataaag ggaatatcac cactgacctc gcagaaacac aaactaccat cagagaatac 1140 

tataaacacc tctacactaa taaactagaa aacctagaag aaatggataa attcctggat 1200 

gcatacacct tatcaagact aaaccaggaa gaagttgaat ccttgagtag accaataaca 1260 

agctctgaaa ttgaggcagt aattaatagc ctaccaacca aaaaaagccc aggaccagac 1320 

agattcacag ctgaattgta ccagaaatac aaggaggagc tggaaaaaga gcctgtggat 1380 

ttctcagtca aatctcaagc ggatagagaa tttcagttct ttgaccacaa tctgatggaa 144 0 

tccattaaaa tgggtgatcc caaagttcat gaattcctta ggttacttgc tctctgccac 1500 

actgtaatgt cagaagagaa tagcgcag 1528 



<210> 226 
<211> 515 
<212> DNA 
<213> Homo sapiens 



<400> 226 

attcccgggt cgacgatttc gtctccttcc cccttccgga agctatagac acttccgaga 6 0 

gagcgcgaac ttgcggagca ctttgcatgc gcgccgcggg taggactgga cggtcggatt 120 

tgtaggcaat tctgacaccg agctcccggg ctcggttggc -agaagatcgc tttgggaatc 180 

ctcctactct acgaggacca gaaaccaagg acgccaagcc atacaaattc agcacagctg 240 

actcagagag gtggaaagga aatcaggaca aaaagccaca atgtcttctg gaggaggata 300 

ttgccagcca gaataactgt gagagatgca agcatagtcc tggaaacaaa cagcacaatt 360 

cctttaaagc cctaaaggat gaagatcttc ctgtagagaa gtatttaatg gaaaggcagc 420 

ctgtaggtga gccagctgcc gaccaggtag ctatggatgt gatgcacagc cccatcctcc 480 

aactggaaag gaaacacaaa gtctcaagtg acagc 515 



<210> 227 
<211> 1023 
<212> DHA 

<213> Homo sapiens 
<220> 

<22l> misc_f eature 
<222> (1) . . . (1023) 
<223> n = a,t,c or g 



<400> 227 

gtagtgtaca ttctttttca aatggaagag ctggttatag ccaacaacat cagcatcgac 60 

atagtggaca tgaaagagaa gaggaaagag gagattggaa ctgggcaaac tattaaaatg 120 

caaacagaaa acttgggtgt tgtttattat gtcaacaagg acttcaaaaa tgaatataaa 180 

ggaatgttat tacaaaaggt agaaaagagt gtggaggaag attatgtgac taatattcga 240 

aataactgct ggaaagaaag acaacaaang gcaattaggc aggagaagga aataaagggt 300 

attcaattag gaaaagagga agtcaaattg tccctgtttg cagatgacat gattttatat 360 

ctagaaaacc ccattgtctc agcccaaaat ctccttaagc tgataagcaa cttcagcaaa 420 

gtctcaggat acaaaatcaa tgtacaaaaa tcacaagcat tcttatacac taacaataga 480 

caaacagaga gccaaatcat gagtgaactc ccattcacaa ttgcttcaaa gagaataaaa 540 

tacctaggaa tccaacttac aagggabgtg aaggacctct tcaaggagaa ctacaaacta 600 

ctgctcaagg aaataaaaga ggatagaaac aaatggaaga acattccatg ctcatgggta 660 

ggaagaatca atatggtgaa aatggccata ctgcccaagg taatttacgg attcaatgcc 720 

atccccatca agctaccaat gattttcttc acagaattgg aaaaaactac ttcaaagttc 780 

atatggaacc aaaaaagagc ccacatcgcc aagtcaatcc taagccaaaa gaacaaagct 840 

ggaggcatca caccacctga cttcaaactg tactacaagg ctacagtaac caaaacagca 900 

tggactagaa aaatttattc agctaagaaa cgtaaagtca aaatttcagt ggaacctgtt 960 

tacagcggtg tgacactaac tacagcaata cagcttgttc ctcttctgtg cacagctctg 1020 

taa 1023 
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<210> 228 
<211> 936 
<212> DNA 

<213> Homo sapiens 



<400> 228 

atggactacg agaaagcaga caagagacca actccttggg aagcagcagc aaagtctcct 60 

ctcggtctag tggatgatgc tttccaaccc aaaaacatcc aggaatccat tgtggcaaat 120 

gtggtttcag cagctcggag gaaggtgctt ccagggcctc cagaggattg gaatgaaaga 180 

ctgtcctata ttcctcaaac ccagaaggcc tatatgggct catgtggaag gcaagagtat 240 

aatgtcacag ccaataataa tatgtccacc acctcccaat atggttcaca gttgccatat 300 

gcatattata ggcaggcttc aagaaatgat tctgcaatca tgtccatgga aaccaggcac 360 

ttgtacactc gccagcttta ctgttacagc tttggagact ctggaaactt ctgtgaaaat 42 0 

acaaatggca gacctgcagc agatgctgtc aggggcctga ctatcctctc actctccacc 480 

acttccatac catccagtgg aataagtgag gctttgatat ctgaaaatga aaacaaaaac 540 

ctcgagcatc tcacacatgg gggttatgtg gaaagtacca ccctgcagat tcgaccggcc 600 

acaaagaccc agtgcacaga attcttcctc gcccctgtca agactgaagt tcccctagct 660 

gagaaccaaa gaagtggtcc cgactgtgca ggcagcttga aagaagaaac aggcccgagc 720 

taccaaaggg ctccccaaat gcctgactcc caaagaggac gcgtggcaga agagctgatc 780 

ttaagggaga aagtagaagc gagtactcag aacaattact atgtaggtga gctgacaggt 840 

gtaaccttac aaaatggtta tggagaaaaa cccatccttg ctactcaagg tgaggctcac 900 

agaccatcag caatggaatc acctggaggc ttgtga 936 



<210> 229 

<211> 1448 

<212> DNA 

<213> Homo sapiens 



<400> 229 

ttaactttgc aataatatgc agccagctaa ttgaaaattc tagtgccata aatgttttta 60 

attgcttact atgtgtcagg cgttgtacta ggcactgggg aaaactgtaa ataaaagaga 120 

cccagttcct tccttttttt caggttgcag tctatcagga aaggcagaca tttaaaaagg 180 

gtcttgtgag gactgttatg aaacaggaag tacgagtgcc atttctatac cttagtacaa 240 

tcactcgttt taattcacat cttccatggt ggcaagtttg cctcagtgtc agccggtcaa 300 

ttgccactca aggggatcag aacatttatg tatctggaat gcttgaatat ttaagagaaa 360 

acctttttgg ccgctttgat aatgataatt tttgtctatt gaatggtgat gctgtgattt 420 

tcaggatgta tgtatcatgg aaactggtag agaaagaaag aactgagatc atgctgaagt 480 

atactggggc ccaccaagag gtagagctga gtgcaccaat tgtcaccaaa atggcaaccc 540 

aatatttaag agaaaacctt tttggccgct ttgataatga taatttttgt ctattgaatg 600 

gtgatgctgt gattttcagg atgtatgtat catggaaact ggtagagaaa gaaagaactg 660 

agatcatgct gaagtacact ggggcccacc aagagacttg gttaaaagac cttgaggaat 720 

ccccccttta cgaagcctta tccatgagag gacaagataa ggagaccctt ggtttgtgga 780 

ttcagcttcc atggtgccct tggggtaaag ctgtccagat gcacatgaac ccctcctctt 840 

ttcagctgga caccaagcca ggaaaaggag agcttgctgg ccgccttatc attccacacc 900 

aggaagccag catccttgag ttgtccttac ttcttatgac ctgctgtgta gaaagagagg 960 

ggaaaaccag tgtccgagtg gctgctgtgg gtgagtgcac agccagcgag acacccaatc 1020 

agggagcagg aaggctgtcc ctgtggcagc agttaaccag taagaaagag accataatgg 1080 

agaaagaaca cactgactgt gtttcacaga ctgttgccct catctccact tgtgttaaag 1140 

aaggagggtc ccgacctgcg gataaagacc tagaggaagg gggaggcctt gaggcggagt 1200 

cccccaagca gagccccaac ttgtgtgtga tcctgcgcca caacttggcc agccggcctg 1260 

gacagctggc tctggtfcact gtgggaacaa tgcaaggaag gccgttgtca cattcctctg 1320 

aggtcaaagg cacaaccttc gtcacacact cagtccctgc tggcaaagag aaagacgagg 1380 

agcgtggaat cggagacctg gagcatgcga gggacttgag aaattcacca actcccttgt 1440 

tttactga 1448 



<210> 230 
<211> 906 
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<212> DNA 

<213> Homo sapiens 



<400> 230 

atgagtgaac tcccattcac aattgcttcc aagagaataa aatacctagg aatccaactt 60 

acaagggatg tgaaggactt cttcaaggag aactacaaac cactgctcaa tgaaataaaa 120 

gaggaaacaa acaaatggaa gaacattcca tgctcatggg taggaagaat caatatcgtg 180 

aaaatggcca tactgcccca gaggttgcct catggttttc tgccaaatat gaaacttgaa 240 

gttgtggata aacggaaccc caggttaatt cgtgttgcta cgattgtaga tgttgatgac 300 

caaagagtaa agcactcaat gacagccagc tcagggtctg gggtttcggc agatctcaac 360 

acagccagcc agcccctatg gcttctgaaa actgcactgg ctgtgtcttc ctctgtcaaa 420 

gtccatcccc ctgtgtcagg tttgatcttc agctcctcca gaaccctgct ttcctttatg 480 

ggaattatga gagaagactt agggttttcc aggaggcaga tcctccattt ccccatggct 540 

ttgtccaaga gtgcaggaag aagatctaag attggtcaat tggatgctct ttcccaggac 600 

tttggcttga gggacagaga ctctagcaag aagggaacag gttatcccaa tccagaaaat 660 

ttttcctgga cagaatacct ggaagctact caaaccaatg cagttcctgc caaagttttt 720 

aaaatggaca gtgatgttgg ggagaacaga aagattctcc gggatgaaag acctaattac 780 

agtcagtata ctccatttag tagatgtgat aatgcatctt acaaagaaaa tgtgtttcta 840 

caaaagctgg agagaaacac accagatatt gcagaaagat ttgactgttt attactgaca 900 

tattag 906 



<210> 231 

<211> 1117 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (1117) 
<223> n = a, t, c or g 



<400> 231 

ccatggaata gatgatgagt tggtcctctg cagagaaggg acccgaaggg cacaggcgaa 60 

gggagtggcc aatgccagtg ggaagatgag ccatgaaacc agaatgggga gtccagaaaa 120 

aaagagagaa aagagaaaag aagaaaagag acatgagagg agaggagagg tgagaaaaga 180 

gagaagagag aaaagaaaag gagaagggag ggaaggatgt ccatatcttt tgcgaggagg 240 

tacagcatac tctagagcac gggtccccag tccccaggcc acgaaccggt acagctctga 300 

agcaataact gccctccaac tgtttaatga gactcaagct cttatgcaag ctatagaatt 360 

gtccaaaggc ctccggagag aagaaaccag atgcatgcag tctggtacca acacagcatc 420 

caatgcaaca ctgttatact gtgcatcaga tcttgtgact cttgtggcag acattaccac 480 

ttgccatgga tatccaagtt cccaatcctt gcaggaactc tgggaaaaag catcaccctc 540 

aaatgtggac aagataatga gggccacaga gaacacagat gtgctgggtg ctccaaaagt 600 

atgtgtgttt gtgattcaga atatacacaa tgggccagga gcagtggctt acgctggtaa 660 

tcagagcttt gggaagcgaa gtttatcccc acataattta aggagtaatg ctctcctggc 720 

agtggactca gaacctaaat tcccccttct taagccactt gagttttgcc ctggagtaag 780 

catgacatca gggtgcttgc tgcttcctcc agtggcccga gctctgtgtg cctttgttga 840 

acaaggtttt cgtggagctg aggtgactct catgaacaga gaatacctgt ccttcataca 900 

gagaagcacc caaatgcaaa catctacttt ttctggtgtt caaccnagaa ttcttggagg 960 

aggtagactg caggggatgg gctggttgac gattgacttt tctcctcaag catctaaacg 102 0 

tgccctaacg tcttccatta tggagctggg gccaggtgtc agcaaactgg ttcagtggca 1080 

gctggagaac ctaaaaaggc attgtgggcc tctgtag 1117 



<210> 232 

<211> 4707 

<212> DNA 

<213> Homo sapiens 
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<400> 232 

tcacaggcac ctgctgctgc tgctgctgcc tctgctcttg ccctcagtcc ccgtctttcc 60 

acctgggtcc ccttgcactt tcatgcctga ggctgcactg gtggccaagt ctaaactgag 120 

ggtcctcccg cagaccgaga gccgccgcag cgcctcggag gcggcctcca cgggcatgct 180 

ggagcgctgt tccagcagca tctccagcag tgagctcatc tccgagacaa aggtgctggc 240 

cagcctcgtg cctgttgggc tcagctctgg tgcctctgcc ttgccgaacg tctggaaggt 300 

ccttggctcc tccgtggctg cagcatccgg ggagatcaca ttgtcacggt agttggtggt 360 

gaggggcaaa gagagtctcg ccatccaggc cgggtcaggg gcgctcagcc ggtccagggc 42 0 

cagacctgtg ctggggtcca gcagacttga cagctcttct tctagcagtt gcttcacaga 480 

gaggtcctct tcaggatcca aaggcccttc ctcctgttct gggtctgtct ggcctccagc 540 

ccttgcactt gggccatctg tgtctgggat tgcactcctg aggggtttta aggcgtagga 600 

gtccttctgt ttgtttggtt ttgttcccct gtagccattt ctgctgctgc tgcctcctgg 660 

cttggccaag tacttatttc ctcggtggtt tggtttgggc tggaattggc cctgatgcag 720 

caaggacagc agctgggaga tttgctgaac aggaggagaa tccacagtga gctcctccac 780 

ggggttccgc tcggcgaagg cagccacaga cagccggacc aggctccgca ggatctgacg 840 

gggccctgat tctttctcag gggacacttt gccattgaga tgtcgctgcc gtctcagggt 900 

tgcagaggag gctggtggcc tctgtggggc ttcctcactg ccctggtctc cagccagcct 960 

ccctgtgggg ctgcctgcaa ccttcagagg cctggaacgt ggctggcctg tggcaggctg 102 0 

gggctcggga aggttcaggt tctcccggga ggcattcctc tgcctgggat ggttgaaaag 1080 

gaggttgacc gtgtcttgca gcacctctcg gctctccgcc ggtgctccct ggttgccttg 1140 

attacgcagc gtcctgtaca gggtcggggt gaggtggaag ggggcctgca ggcaggggtc 1200 

ccagcctgct tccatcatcg cctccttgtc cacatctttg tgggactgcc cgacttcaca 1260 

aggctcacct gcctgacccc tgagcacagg cacgaggtgg atgtctgcct tctgaatgtg 1320 

tttctggggc ctcttgggct gctggcggta ggtggactcg gcctcccgac agttgtaggc 1380 

cctgttgtcc ttcttttctg tccggcagat ggacatgaac aaagccagga tcaacccgaa 1440 

gatgcccaac agtacagcca ggcagatcac cgtcagcatc gacatgctca aggccccagg 1500 

cttgcgggct gagtccctca ggtggtccac actggtgaca aacatgaccc tcaacagggc 1560 

tcgggtctgt aagggggggc ttccctggtc ctctactact atctccagct cccactcact 162 0 

cccaatgagg ctgctggcat tggtgacatt gacgaacagc tgccccgtat gagggttgag 1680 

gatgaagagg tgggcttcat ttccactgcg gatgctgtag aggggctctc catttgcccc 1740 

cgagtctgca tctcttgcca caatggttgt caaaaggaat ggccgggagc tgtgagtggc 1800 

cagtggaggt gtgtcagtgc ccgctgggcc caagccattg ggagtctcga tgggcaccag 1860 

caggtggcct gtggaggcat tcacaagcac ggagaggctg gcttttccat cgctgagcac 192 0 

aggctggacc acctctgggg cattatcatt ggcatccaag aggctgaccc acacagagac 198 0 

actggatgca agcatgggtt gcccgctgtc ctctgcgatc acctggaact caaagccggc 204 0 

catctcttca tagttcagtg acctctgagc agtgacctct cctgtgttgg agtcaatagc 2100 

tactaagtga gcaactgggg agtcctggat gcggtatgag acttttccat taatgcccaa 2160 

gtctgcatca tgagccttga tggtaatgag gtgaagagag ggtaagttgt tttcccgcgt 222 0 

ggagacttca tacctgcttt tctcaaacac aggtgcattg tcgttgatgt cactgatctg 228 0 

aatgctgagc tgtttcttgg ctgataaggg ctggagtcct tggtcttggg ctaacagagt 2340 

gagggtatat ttgggccact gctctctgtc cagtgtggca ttggttagca acatgtatgt 2400 

gttgccatta gttcttttca gcctgaagtg gcccagctct tggctcagcc agcagtggac 2460 

caaaccattg tgtcctgaat ccaagtcatc tgccatgaca agagcaataa aactgtcctt 252 0 

gggaagagct tctgacacca gtgatggctg ggaggcccat gtgacgtgga tgcttgggat 2580 

gttgtcattg acatccagaa ccttgatgag aactttgcaa tgggctggga taggattggg 2640' 

acccaggtcc cttgcctgaa catccacctc gtaggcaggg ttcttttcat agtctagagg 2700 

tcgacgcaga atgacctggc ctgtcttggc atcaatactg aaggtgtcca gcacctctgg 2760 

aggcatgtgc ttactgagga agaactccac ctccccattg gggccttggt cagggtctgt 282 0 

ggcggtcagt tttatgagaa gcgtaccagg tgcagcatct tcttggattt ccagtgccag 2880 

tgaactctca gcaaacgcag ggctattgtc attggagtcc aagacgttga ccttgaccaa 2940 

gctggtacct gacttggggg ggttcccatt gtcataggca gttaacacca gatcaaaaaa 3000 

tgaatggatt tccctgtcca gctccttcac cactatgagt tctgcatgtt tggtctcatc 3060 

agggcccaca atgacatcca aggcaaagtg ctcactgctg ctgctgctgc tgcctctgct 3120 

cttgccctca gtccccgtct ttccacctgg gtccccttgc actttcatgc ctgaggctgc 3180 

actggtggcc aagtctaaac tgagggtcct cccgcagacc gagagccgcc gcagcgcctc 3240 

ggaggcggcc tccacgggca tgctggagcg ctgttccagc agcatctcca gcagtgagct 3300 

catctccgag acaaaggtgc tggccagcct cgtgcctgtt gggctcagct ctggtgcctc 3360 

tgccttgccg aacgtctgga aggtcctcgg ctcctccgtg gctgcagcat ccggggagat 3420 

cacattgtca cggtagttgg tggtgagggg caaagagagt ctcgccatcc aggccgggtc 3480 

aggggcgctc agccggtcca gggccagacc tgcctgaccc ctgagcacag gcacgaggtg 3540 

gatgtctgcc ttctgaatgt gtttctgggg cctcttgggc tgctggcggt aggtggactc 3600 

ggcctcccga cagttgtagg ccctgttgtc cttcttttct gtccggcaga tggacatgaa 3660 

caaagccagg atcaacccga agatgcccaa cagtacagcc aggcagatca ccgtcagcat 3720 

cgacatgctc aaggccccag gcttgcgggc tgagtccctc aggtggtcca cactggtgac 3780 
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aaacatgacc ctcaacaggg ctcgggtctg taaggggggg cttccctggt cctctactac 3840 

tatctccagc tcccactcac tcccaatgag gctgctggca ttggtgacat tgacgaacag 3900 

ctgccccgta tgagggttga ggatgaagag gtgggcttca tttccattgc ggatgctgta 3960 

gaggggctct ccatttgccc ccgagtctgc atctcttgcc acaatggttg tcaaaaggaa 4020 

tggccgggag ctgtgagtgg ccagtggagg tgtgtcagtg cccgctgggc ccaagccatt 4080 

gggagtctcg atgggcacca gcaggtggcc tgtggaggca ttcacaagca cggagaggct 414 0 

ggcttttcca tcgctgagca caggctggac cacctctggg gcattatcat tggcatccaa 4200 

gaggctgacc cacacagaga cactggatgc aagcatgggt tgcccgctgt cctctgcgat 4260 

cacctggaac tcaaagccgg ccatctcttc atagttcagt gacctctgag cagtgacctc 432 0 

tcctgtgttg gagtcaatag ctactaagtg agcaactggg gagtcctgga tgcggtatga 438 0 

gacttttcca ttaatgccca agtctgcatc atgagcttga tggtaatgag gtgaagagag 4440 

ggtaagttgt tttcccgcgt ggaacttcat acctgcttgt cttttttttt gtctttctcg 4500 

tgcgttggtg tgaccccccc ccacttcctt cccgtacggg gtgtctgccg gtcggagacg 4560 

ctcctcactt cccagatggg ttgttgctag agcaggtgat tcaatcttcc ttaggtaatg 4620 

ccgtggtcca aaggcgatcc cacgcagcac cacattaaga cgcgtttctg ccacaggcac 4680 

aggttcgggt tgttttacct gttccat 4707 



<210> 233 
<211> 1287 
<212> DNA 

■<213> Homo sapiens 



<400> 233 

gcgtcgaccc acgcgtccgc aggaggaagg tgggtctgaa tctagcacca tgacggaact 60 

agagacagcc atgggtttga tcatagacgt cttttcccga tattcgggca gcgagggcag 12 0 

cacgcagacc ctgaccaagg gggagctcaa ggtgctgatg gagaaggagc taccaggctt 18 0 

cctgcagctc tctggcccag ggctgggtca tcagcacacc ctgctgctgc tgttcagatc 240 

tgcatcctgg tcccgcttgg tcccacagtg agaacgcttt gctatcacat gggcaggctc 300 

tgagagccct gccggcctgg ccttctcaaa gaagacctga gagcttggga cccaagcaga 36 0 

gaggaagaac agggctcagg gtgcttgctc catgctcgct ccacacctgg ggctcaaccc 42 0 

tggctttccc cggctccctg tgtgacttca gggcaggtcc cttgggccct ctgggcctta 480 

tcatcttcat ctgtaacagg gcgatgcctc tgccgtgtct ggtggtgttg aggagttcct 540 

gtttgtgtaa gcagctagtt cagtgccagc acgagatggg aggcccatga agttagcagt 600 

gcacaaaaaa tagagcaaag actggatgca tttcctgaga acaaccatca ctgtaaagca 660 

ctttacaaat ccaaagacaa cccccggcaa aaactcaaaa tgaaactccc tctcgcagag 720 

cacaattcca attcgctcta aaaacattac aagttagttc atgtcatgcc agatagctga 780 

aggcagcttc acaagttctt aaggccagga atgccatgtg tctgctatgc acagctgggc 840 

cctggccctt gaggcctgga atggacaggc aaaggttgac ggcagatgtt gggtgccctg 900 

cttccttgcc ccagcagcag tgcttggtgg aggctgaggc cctgcacagg caccctcact 960 

gctgaccttg agcctctctc tcctctaaag tgggaaaaac aagggaatcc gtggataaaa 102 0 

atggctcaag gacctgggac gccaatggag atgcccaggt gggacttcag tgagttcatc 1080 

gtgtttcgtg ggctgcaatc acgttttgcc tgttcacaag tattttgaga aggcaggact 1140 

caaatgatgc cctggagatg tcacagattc ctgggcagag ccatggtccc aggcttccca 1200 

aaagtgtttg ttggcaatta ttcccctagg ctgagcctgc tcatgtacct ctgattaata 1260 

aatgcttatg aaatgaaaaa aaaaaaa 1287 



<210> 234 

<211> 2640 

<212> DNA 

<213> Homo sapiens 



<400> 234 

^SS^-SS^S 9 caa STCf a 9£f a gcaagtcaca tcttacgtgg atgtccagag agcttgtgca 60 

ggtattcgag gtgcctttga aaagccccag ggtgctgtgg ccagggttca cattggccaa 120 

gtcatcatgt ccatctgcac caagctgcag aacaaggagc atgtgattga ggccctatgc 180 

aaggccaact ttaagttccc tggccgccag aatatccact tctcagagaa gtgggacttt 240 

accaagttca gtgtggatga atttgaagac atgatggctg agaagcagct tatcccaata 300 

actgtggggt caagtatacc cccaatcgtg accctccgga caagcgggat ggagtcgccc 360 
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tgcaacacgg gctgctccta tggcaactgc tacaaaacaa gatcaggctt caccagggtc 420 

gggagaagaa acctcccaag aaagccaggc gctagagtca tgggtgtgga tgtcctttta 480 

gggcctagga aagtagggtc tgatgaggag gcattgaaga aaagccagta tgacgacgat 540 

gctttcatgt actcccccat ctgccaccaa ccacctatcc gtggtttcgc caccacccac 600 

ctatccgtgg tttctccacc accctgcttc ctgggatgca gcaaccccca aacagatgca 660 

gttttgggcc agcgttatgg aagagaaagc aaatccccca aatgtctgaa tcacaaacac 720 

agagacctca cccccgtgac agtggaaaca aatgccctgg cctctgaccc ctacagatcc 780 

gcgttcctgc ccctaaacgg tgcagtgagg ggagagctgg gctgcagaac aggtggagag 840 

aagaacgtga agagcaaaag agagaagagg ggtgttaccc agagggtaga agctgcagca 900 

gtccccgggc tggagctgct gaatgtctgg agacgtcccc actgctgggg agcggaagcc 960 

aatcctgtgt ctccgctgga gcctcagatg cttctcgggc cccctgttgt atacagtcac 1020 

tggtaccgct gccgctggag ccagaatttc gacatgacag ctcgccagtt tcacaatgca 1080 

taccttcctt atcagagacc gttgatgatg gagatgctct ggccagtcta tagagaatgc 1140 

acagaggatt ttcggtcctc tgcttcacct tctgatgtca gaaggctgaa aactccacac 1200 

tcagattatg ctaatgctgc catttgtgta catgtgaccc atgaagagca actctcctct 1260 

ggcgtgacag ccggcatgga ggatccacag agtaaagagc ctgccggcga ggccgtggct 1320 

ctcgcgctgc tggagtcgcc gcggccggag ggcggggagg agccgccgcg tcccagtccc 13 80 

gaggtgagga gtcgacccgc gcctggctgt tggcgccagc tgccccgtcc ccaaagcgcc 1440 

ctcggcaccc ctttcttctc aggtgtcccg cagaaggtgc agctgtgcgc ccttggaccc 1500 

cagcctcttc ctcggtgccc agctccctgg ctttatcctc gggaaactca acagtgtaaa 1560 

tttgatggcc aggagacaaa aggatccaag ttcattacct ccagtgcgag tgacttcagt 1620 

gacccggttt acaaagagat tgccattacg aatggctgta ttaatagaat gagtaaggaa 1680 

gaactcagag ctaagctttc agaattcaag cttgaaacta gaggagtaaa ggatgttcta 1740 

aagaagagac tgaaaaacta ttataagaag cagaagctga tgctgaaaga gagcaatttt 1800 

gctgacagtt attatgacta catttgtatt attgactttg aagccacttg tgaagaagga 1860 

aacccacctg agtttgtaca tgaaataatt gaatttccgg ttgttttact gaatacgcat 1920 

actttagaaa tagaagacac gtttcagcag tatgtaagac cagagattaa cacacagctg 1980 

tctgatttct gcatcagtct aactggaatt actcaggatc aggtagacag agctgatacc 2040 

ttccctcagg tactaaaaaa agtaattgac tggatgaaat tgaaggaatt aggaacaaag 2100 

tataaatact cacttttaac agatggttct tgggatatga gtaagttctt gaacattcag 216 0 

tgtcaactca gcaggctcaa ataccctcct tttgcgaaaa agtggatcaa tattcggaag 2220 

tcatatggaa atttttacaa gggggaacat ccgcatccaa gagacgcctg cagtccaggg 2280 

ggctccagag ggagggagtc ctgtggatgt ggaagatacg cttttgtctg agagtcatga 2340 

gattggactg tgggtccctg ttgttgttaa cccctgtgtg atcttgggca attcactgtt 2400 

cctgtcttta cctctacttg ctcctcagta agtcagagtg aactgagatc tcttcctgaa 2460 

attacttagc ctaattctat ggttggtttc tggtggccag gaaccagaaa taaatctctc 2520 

agcaaaattt tgcttatgga ccatgaatct tagagtatcc atggtatcac tgatatcttt 2580 

aattcccaga tattttccag taatgaaatt catataataa agccaggatc aatggttctt 2640 



<210> 235 

<211> 426 

<212> DNA 

<213> Homo sapiens 



<400> 235 

atgtctcgcc gcaagcaagg caaaccccag cacttaagca aacgggaatt ctcgcctcgg 60 

gaccgcgagg aggtcaccac gtgctttccc tgcccacccc ccaccccgcc cggtctggtc 120 

accagtcccc ctgcgccccg agcacgacta ggccagcctt gttcagccag aaatgaaaac 180 

ctcctcgaag cagattatga ccccccagaa cccatcgtcc taagaaacac cactgccacg 240 

cacactcact cacactcagt ctctccctcc ctgtataatt ctgattctcc ccagcctttg 300 

aagcatcttg gagccgtgtc agctgcagag actggagtcc ggggcatgat ggggatgtac 360 

ctgaaaccta gtgagagtcc attcctgact ctggggaaga gccgctcaaa gatggggatc 420 

ctctag 426 



<210> 236 

<211> 3204 

<212> DNA 

<213> Homo sapiens 
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<400> 236 

atgtgcgaac tcgatatttt. acacgactct ctttaccaat tctgccccga attacactta 60 

aaacgactca acagcttaac gttggcttgc cacgccttac ttgactgtaa aactctcact 120 

cttaccgaac ttggccgtaa cctgccaacc aaagcgagaa caaaacataa catcaaacga 180 

atcgaccgat tgttaggtaa tcgtcacctc cacaaagagc gactcgctgt ataccgttgg 240 

catgctagct ttatctgttc gggcaatacg atgcccattg tacttgttga ctggtctgat 3 00 

atccgtgagc aaaaacggct tatggtattg cgagcttcag tcgcactaca cggtcgttct 360 

gttactcttt atgagaaagc gttcccgctt tcagagcaat gttcaaagaa agctcatgac 420 

caatttctag ccgaccttgc gagcattcta ccgagtaaca ccacaccgct cattgtcagt 4 80 

gatgctggct ttaaagtgcc atggtataaa tccgttgaga agctgggttg gtactggtta 540 

agtcgagtaa gaggaaaagt acaatatgca gacctaggag cggaaaactg gaaacctatc 600 

agcaacttac atgatatgtc atctagtcac tcaaagactt taggctataa gaggctgact 660 

aaaagcaatc caatctcatg ccaaattcta ttgtataaat ctcgctctaa aggccgaaaa 720 

aatcagcgct cgacacggac tcattgtcac cacccgtcac ctaaaatcta ctcagcgtcg 780 

gcaaaggagc catggattct agcaactaac ttacctgttg aaattcgaac acccaaacaa 840 

cttgttaata tctattcgaa gcgaatgcag attgaagaaa ccttccgaga cttgaaaagt 900 

cctgcctacg gactaggcct acgccatagc cgaacgagca gctcagagcg ttttgatatc 960 

atgctgctaa tcgccctgat gcttcaacta acatgttggc ttgcgggcgt tcatgctcag 1020 

aaacaaggtt gggacaagca cttccaggct aacacagtca gaaatcgaaa cctcaaaatt 1080 

tattcacaca tggttacgct ttggggaaat tatgagggga tctctcagac tcaagccttc 1140 

gcgaaagaaa ataaccaaaa agcatacaaa gaaacgtacg gcgtctctca tattacacgc 12 00 

catgatatgc tgcagatccc taaacagcag caaaacgaaa aataccaagt gcctcaattc 1260 

gatcaatcaa cgattaaaaa tattgagtct gcaaaaggac ttgatgtgtg ggacagctgg 1320 

ccgctgcaaa acgctgacgg aacagtagct gaatacaacg gctatcacgt tgtgtttgct 1380 

cttgcgggaa gcccgaaaga cgctgatgac acatcaatct acatgtttta tcaaaaggtc 1440 

ggcgacaact caatcgacag ctggaaaaac gcgggccgtg tctttaaaga cagcgataag 1500 

ttcgacgcca acgatccgat cctgaaagat cagacgcaag aatggtccgg ttctgcaacc 1560 

tttacatctg acggaaaaat ccgtttattc tacactgact attccggtaa acattacggc 1620 

aaacaaagcc tgacaacagc gcagaaagca taccggcttg aaattgtcag cttggaaatg 1680 

cagaaaaatg gagccgcaga tgcagcaccg taccgacaaa ttgagtactg ggctctgggg 1740 

catggtgacg acattaaaaa agctgttgcc ttctggtcct ctgggtggcc agtcggcttc 1800 

tccaaaatgg agaaggcagg gaaaatactc aggtcccaag tgaagtttcc agaatatatg i860 

gaagaatcct cctgtctagg tagagggtca ctgatgtctc tcaacaacac aagttctagc 192 0 

aatggcagtt ttatttttgt gttgcccctg aagttgctga gagttgggga cacctacaat 1980 

tcttccgatc agtcaagaat ggcctggcgg ctcaccatag agtttggggg ttctgagttg 2040 

catctggggg tgagggagga agcaggccac cagaaagggc tagtccatga gtccgggaat 2100 

cccgcccgaa gctctggcag tgacccgcaa catgccaggc accggcagcc ctcggctacc 2160 

agggctgctg ctgctgctgc tgctgctgct gctgctctgc ctgcgccgct gtcacttcca 2220 

gttcctactt ctgcgatcca ggtccgggtc actgcttacc cgctgctggc ccagtgcctc 22 80 

caggccgctt tccctccgct gctcggctcc ggctgcgggc aggaggggac cggcgcggga 2340 

ggcggcggcg gcgcggctgg agtgagggag cagctggagg acaggcgggc agcggccgag 2400 

ccaggtgacc tgcctggggg caaaagggtc aggggaaggg gagcgcgcga gggcccgggg 2460 

gtgggtgccg aaggaccccc tttggagagg aaccgccctt cttccccact gccgtggctc 252 0 

gcagccccag ctgcgggggc ctcacagttc gctgagatcc agggtgccgg gaagggggag 2580 

atgagggcca aggacgcaga gcgaggacgc gccaaactta ggggcgagtt gtccagctct 264 0 

ggccgcaaga tatttgatcc tgatgacctt tattcagggg tcaatttctc caaggtactg 2700 

agtactcttt tagctgtcaa caaagcaaca gaagatcagc tatcagaaag accatgtgga 2760 

cgttcctctt ctcttagtgc tgctaatact tctcagacaa acccacaggg agcagtttct 282 0 

agcacagttt cagggctgca aaggcagtca aagacagtgg agatgacgga aaatggaagt 2880 

catcagttga tagtaaaagc aagattcaac tttaagcaga ctaatgagga tgaactgtca 2940 

gtttgtaagg gggacatcat ttacgtcaca cgagttgaag aaggaggctg gtgggaaggc 3000 

acattaaatg ggagaacagg ctggttcccc agtaattatg tccgtgaaat taaatccagt 3060 

gagagacctc tctccccaaa agccgtcaaa ggatttgaaa ctgctccact taccaagaat 3120 

tattatactg tgatgtctag aagtcttaca tctactgtgc tgaaaaattc aaaggtagca 3180 

cggattcaca gcaagcccta ttaa 3204 



<210> 237 

<211> 1353 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> mis cofeature 
<222> (1) . . . (1353) 
<223> n = a,t,c or g 

<400> 237 

tcatgcagca gaagaggaag agtgtgcaag cagaaaatgg aagtcctgtg atactgagga 60 

gtcagaggta tctggagctg tcacagcagc tagaatatga ggagaaaaat tcttgcagta 120 

tcctgtgcca ttgtggtagg taacacacat tccttcattt acacagggtt catagccatc 180 

tcgacactgc aatgctgtaa acccgacttg acaggtgcac tcataggtat cccggctgag 240 

catatggcat gtgccgccat tcaggcaggg tcgagacaca aagcatggat gagatgttga 300 

gtactggcag tcctctcctg taaaccctga ggcacatcgg cacgtggctt tccccagcat 360 

ggcctgggcc acacaagtcc caccattctg gcagcggttc ttctcacagg ggtctcgatg 420 

ttgacaatat tcccccaaga agccttctgg acatcttatc agaatccctg ggtgctgagt 480 

ccaagatcca ccccaacaga ttttgatgta gccaggacac agagtttttt ctaccataat 540 

gggagaggta gggaggagga tcacatgcgc gggggccgcc cagcacagcc angacggcca 600 

gcagcgccca cagcagagcg ggacgcaggg ggggcatctt ctcggtcgcc tcctcctcct 660 

ccgccgccgc cgccgccgcc gccgccgcct gggcagatcc acatggggag ggggtcccga 720 

tagaggagcc ccactctctc ctcccctcct cctgcttcaa aggctcaggc cctggcgcta 780 

cgctccgaag cccaggcgca aatgcctcga ctccccgcgc cccgagtccg ccgctcctcg 840 

gccgccgcct cagccgccgc ccgaagtttg gctgaaactt tctcgggtaa ggagtgccaa 90 o 

tggacggatg cctgcctgtc tcatccctgt gcaaatggaa gtacctgtac cactgtggcc 960 

aaccagttct cctgcaaatg cctcacaggc ttcacagggc agaaatgtga gactgatgtc 1020 

aatgagtgtg acattccagg acactgccag catggtggca cctgcctcaa cctgcctggt 1080 

tcctaccagt gccagtgcct tcagggcttc acaggccagt actgtgacag cctgtatgtg 1140 

ccctgtgcac cctcgccttg tgtcaatgga ggcacctgtc ggcagactgg tgacttcact 1200 

tttgagtgca actgccttcc agaaacagtg agaagaggaa cagagctctg ggaaagagac 1260 

agggaagtct ggaatggaaa agaacccgat gagaattaga ccctggaaaa tatgtatgtg 1320 

tggttaataa agtgctttaa actgaattga aaa 1353 



<210> 238 
<211> 362 
<212> DNA 
<213> Homo sapiens 

<400> 238 

gaaaagagtt gttgactttc agggagccca aacctagggc ctccccaagc cagggttgtg 60 

agacctttga ggctctatgg ttcctagcat ttccaagctt ctgggcacaa ctgcattacc 120 

cggtacccat agtggaagcc acttgcagta tgcctggtcc agctgcagcc tcacacagag 180 

ctagcaccta tgttagcact tggagttgcc cacctcacca cagctggcat gcctggcagt 240 

gcacagtggc cagacctcac ttgcagacct ctcattgctg cacatctggc ttgcccttgg 300 

cagacatgga atccaggctg gttgcttcac caagtgagtg gaacaagctc acttgggccc 360 

a 3 362 



<210> 239 
<211> 1350 
<212> DNA 

<213> Homo sapiens 
<400> 239 

tcactgaaag agagagctac tcaaatccaa atccaaaacg cttccaccag cattctcttt 60 
tcccaataag ccacaggcag aacagccatg gcctacagaa agcagccaaa tgcctgcagc 120 
tgtgagcagg tgtaagatgg ctgtaagggc catccctcca tctccagagg cttccggagc 180 
cttggagccg tggatggcaa gcagacactg ataccggatg agctgtgtga tcatgggcac 240 
atcacgtctc tgaaccttgg tgttcttcct gtgagatggg ccattaacac caggcacctc 300 
ataggatgga tgtgaagact tccaggctgt tgacaccagg ctcagattac tttgtctatt 360 
aattcctggt aaacagtact ttctagtact gccaaagatg cttacagttc ccttcttgct 420 
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catatgaaca ccaaaatagg tgttcctgaa gggcagaggc tgttctccaa gcagcgtctg 480 

aagcaccagg ctagttccat cttctggctt caccatttcc aacacagagt tcccaaggtc 540 

agtaagttcg tctgcatgga ggcaatagca agagaagaag agagcgagaa gactcacttg 600 

ctccttaacc acttctgcct ggaagtgatg cgtatcactt ctgcttacac tccactggca 660 

aggacccatc acatggtccc atctactctt ctccatggag aaccagatgg cccaggtatg 720 

agtggttgag gggctggtgc cccgatgagg tgggggcaaa gatgacctct ctccagggtg 780 

atgtgaggaa atggagctcc ggaaggtaag aggcttgctc cccatcacac agctggcgtg 840 

cgtggccaag agcagggcct gcgtgtgttt tctgggctgt ctccaccgca gtgctgactc 900 

tcctcctcct acttgccact gataggcctg gacttgcaca atttcaatcfc caccaaaaat 960 

cacattcaat tcagtgaccc tgattatctt cagtgtgtgt gtgtgttttt ctataggctc 1020 

attacgatca ctaggtccca caataggctc aataggctct ctgcaagtgg tcacagtaaa 1080 

agcaagttga taaaggtggc cagtggggga acagagctta tgtgcctcct gagaaagcca 1140 

gtcgctggaa actggactga gagccctgat ttggagcacg aggaaacctg gtgggcctgt 1200 

gctgggtggc ctggtgttga taaaatgggc ttgggccagc cgctcaccag cttctccctc 1260 

agacccatcc ccaggcccca acctctgctg cagccctacc agccccgcca caaagcccag 1320 

ggtggacgga cccttcgtca tcagaaacat 1350 



<210> 240 

<211> 618 

<212> DNA 

<213> Homo sapiens 



<400> 240 

atgaattggg tgttacaaaa atttatcact gcatggaagt ttatgggcta cagaaagt cc 60 

tccaacagcg caagaggaag caeca taaaa gagcacattg agttggatgc acagagacct 120 

gtcaggagaa gtgggccaat acaagcatca ggcgctcatc ccaagaaggg aaggggtgtg 180 

agttgctcag tggaagagee aagtgaccag cagagtccgt caccaccctc ccctcttacc 240 

tttcagccca aagaeggaga aatcaacttt tcagtaattg gtcagtatgt ggattatctt 3 00 

gtgaaagaac agggagtgaa gaacattttt ggtaagtcaa ctctggggat gtegctgeat 360 

gtctccagct cagtctttag aaggtatatc ttacctggtt accagccaag aggtcacaca 420 

gtcatggtgt cacaggtaaa cattgacttc cagacccgag aagecacgag gaagaacctg 480 

caggagecat ccctgacttg ctttgaccaa geccaaggaa aagtacacag cctcatggag 540 

aaagactctt accccaggtt cctgaggtcc aaaatgtact tagatctget gtcccaaagc 600 

cagaggaggc tcagttag 618 



<210> 241 
<211> 669 
<212> DNA 
<213> Homo sapiens 



<400> 241 

atggtgatgt cattegtcaa gecaggagta aaggagaagg agcaggttaa gaagagagat 60 

ggtgagttca attctgaaca tgcagaattg gatgtgcctg ccagggacac caagaggaaa 120 

ttctgggagc ccaccaggct cagcagtacc ctgeggacat cgagcgaccc tttattttct 180 

gtccccatca gcattacgat ggtttgtgag ccaggctcca agtccctgca aagctgctgt 240 

ctgacagcag ggggcgccaa tgtctgggaa aaatctacct geegcaaaaa atccagacaa 300 

ttggtgctga ggaatgtaaa ggttccaggg aaaageccat gtggtgaact gctgcccatc 360 

ctcaaaaaaa accagttaaa catccttctc ttgcaaccag tggacacaga gactttggaa 420 

gggcctccag gccttggcct ggatgeagag ggecctgaga agagacacag ctggatcctc 480 

ctgccctgcc ctgggattga ccacacctct ggcttggagg ttatgtctga tctttatcac 540 

aggaagggaa actcgctcca tccccaagga aagagaacca aggatgecag aaaggaaagc 600 

ttcccccaga agatgggaca atttccctta caaagtctgg cagtcatcta ccccgaagca 660 

gggacctag 669 



<210> 242 
<211> 2043 
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<212> DNA 

<213> Homo sapiens 



<400> 242 

atggaagaac attccatgct catgggtagg aagaatcaat atcgtgaaaa tggccgtatt 60 

gcccaagaat tggaaaaaac tactttaaag ttcatatgga accaaaaaag agcctgcatc 120 

accaagtcaa acctaagcca aaagaacaaa gctggaggca tcacgctacc tgacttcaaa 180 

ctatactaca aggctacagt aaccaaaaca gcatggtact ggtaccaaaa cagagatata 240 

gatcaatgga acagaacaga gccctcagaa ataatgccgc atatctacaa ctatctgatc 300 

tttgacaaac ctgagaaaaa caagcaatgg ggaaaggatt ccctatttaa taaacggttc 360 

tgggaaaact ggctagccat atttagaaag ctgaaactgg atcccttcct tacaccttat 420 

acaaaaatta attcaagatg gattaaagac ttacatgtta gacctaaaac cataaaaacc 480 

ctagaagaaa acccaggcat taccattcag gacacaggca tgggcaagga cttcacgtct 540 

aaaacaccaa aagcaatggc aacaaaagcc aaaattgata aatgggatct aattaaacta 600 

aagagcttct gcacagcaaa agaaactacc atcagagtga acaggcaacc tacaaaatgg 660 

gagaaaattt tcgcaaccta ctcatctgac aaagggctaa catccagaat ctacaatgaa 720 

ctcaaacaaa ttcacaagaa aaaaacaaac aaccccatca gaaagtgggc gaaggatatg 780 

aacagacact tctcaaaaga agacatttat gcagccaaaa aacacatgaa aaaatgctca 840 

ccatcactgg ccatcagaga aatgcaaatc aaaaccacaa tgagatacca tctcacatca 900 

gttagaatgg caatcattca aaagtcagga aacaacagag ttttacccct tgcacctctg 960 

gctctggctg ccctctggat ggatcctgtg atgccaggca tggatggcct gctaggggac 102 0 

tcagagagct tccagggcct ttctgctact ttctttgcct ctgtatttca ctcagctctc 1080 

cacattgact cagctccagg gccctgcatc ggccctggag actcctctgc tgattcctct 1140 

ccaacatttc ttcctccaga agccaaaagg aaaaactacc tcctcctttg gaggaaaaac 1200 

ctaaaaaagt tctcagatga ccccaaaagg ctcatagagg gcttccccaa attggctctt 1260 

acatttagat taatttggaa ggacataaat gtactcctgg gacaagccct tttacaagag 1320 

gaaagacaga ccatctgcgg agcagcaatc cactgtcgaa atgacttaca cctggaaaat 1380 

gccaactacc ctggaggggc tacagctgtt ccccagctgg accccaatca ggactacaat 144 0 

gccaaagcgg gaatatgggc caggaaccat aggctcttgt gtctaataga gacaacgacg 1500 

caacaaccga ccaacgcaca cagtccacaa acacaacgac aacaacacga cacagacaaa 1560 

ccacaaccga acccaccagc caaaacaacc ggtgttcctg tatcttttct ggcttttcta 1620 

tatcagtatc tgtgtgggca catatccata agctggcccg tggtaattct taagtacgct 1680 

gcgagcgtct atggtatttc tctagcagac cgtaagcgac aatatgacag atactttcga 1740 

tatgaacgtt tacgcaccat aaagcctaat ttcttgccat tccaaatatt taagtctggg 1800 

agtgtggtaa agctaaaagc tggcttcaca atcggaaaag ttcacaatac agaagtaaca 1860 

gctttaaagg tttctgacac tcgaagagcg cagcatttgc agacaggatg ttggagtgca 1920 

gttgtgaccc acccaaacaa cttggaaaat gtagtacggc atccgccaga ggctctggct 1980 

gcttcttaca acaaaccctt catctgctcc ctggtcacac ttcaaggtgc atttgtgact 2040 
tag , 2043 



<210> 243 

<211> 1116 

<212> DNA 

<213> Homo sapiens 



<400> 243 

atgctcatgg tctatccccg gaccaacaag cagaatcaga agaagaaatg gaaagtggag 60 

ccccccacac cgcaggagcc tgggcctgcc aaggtggctg ttaccaccag cagcagcagc 120 

agcagcagca tccccagtgc tgagaaagtc cccaccacca agtccacact ctggcaggaa 180 

gaaatgagga ccaaggacca gccagatggc agcagtctga gtccagctca gagtcccagc 240 

cagagccagc ctcctgctgc cagctccctg cgggaacctg ggctggagag caaagaagag 300 

gagagcgcca tgagtagcga ccgcatggac tgtggccgca tccccagcac ccccaaccac 360 

aggaggtccc aggtgattga aaagtttgag gccttggaca ttgagaaggc agagcacatg 420 

gagaccaatg cagtggggcc ctcgccatcc agtgacacac gccagggccg cagcgagaaa 480 

agggcgttcc ctagcaagca ggacttcacc aatgaagccc ccccccaagc tcctctccca 540 

gacgcctcgg cttcccccgt ctccacaccg aagagccaag tcactggaca ggagggtcag 600 

aacatatcct gggatatggc ggtagtcctg aaagcaactc aggaggcacc tgctgcttca 660 

acccttggca gctactcatt accagggact ctggccaaga gtgagatact ggagactcat 720 

gggaccatga actttctaga cagcagtcct caggtccgtt accatccaag gaatcttggg 780 

acagcctgta accaggcggg actcgaccgc tactacccgg aaggcccact gcctaggagc 840 
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ggggacgaca cagtgctgtc accacggcca gcctggtcca agctagcagc tacgaggagt 900 

cagagactgg acccaagaca aaggggcagc actgaacgag agaaagggga ggcccacaac 960 

gaccccttgg ctgccgcgcc catctcagca gcgggcagcc gcagaggagc cctggcctcc 1020 

tggcttgcgt ctcccacacg ctcccagaac ccggcccgac cccccacgcc agactgtaga 1080 

gcacccctca ccgaaagcgc cagacccaca tcttga 1116 



<210> 244 
<211> 413 
<212> DMA 
<213> Homo sapiens 



<400> 244 

atggggaagt gggatggagg tbttaatttc atggtctttt gtgttttgga aaatacctgc 60 

agtttgaaca ggtctttgct ggaaaacctc atctttgcta cactoaagat cagctggggt 120 

ggtgcaggga ttaagggctg gaatcatatg aaggctcagt tgctcacatg tgtggtgttt 180 

gatgctggct ttcagttaag tccacagggt tctctagagc aaaagcacct catcctttac 240 

cctgaactca caggacctat agagttggac ccaatttgtt accaccacca tctttcaaat 300 

ctgctctcac aaacacccaa agcatccttt gaagttttac accagaatca ctgaaaagca 360 

tccagcagat tttctactgg cagctttcaa ggagcagctg tagctgaaaa gac 413 



<210> 245 
<211> 975 
<212> DNA 
<213> Homo sapiens 



<400> 245 

cccatatggg ccaccagatg ttacgggggg tggagggggt ctttgttctt agagctccca 60 

agatgcgacg ggccgctctc aagatggggg tgggctgctc ccaagatggc tgcaagcctt 12 0 

ttgttctctg accaggggtt cttggcctca tggattccaa ggaatggaac cttgggccat 180 

gcgatgagtg gtatagctct attagaagct gtgggtcatg gaagagaact gtggaaccca 240 

gcgactagtg ctgcttctgt tatctcaaat acatcatcag atgccaatca atttccaaag 300 

ttaccattag gagaaatcag tccagataaa gaaaaatctt actggtcaag taatccaact 360 

aaaaacaatc ccaaagagac aacacgcgtg gctgtttttg cgagaatttt ggatgttaat 420 

gacaatgccc cacactttgc tgtgttctat gacacttttg tatgtgaaaa tgccagacca 480 

gggcagaggg atgcctgtgg aaggcttggt gattgggact cagagaaaca ggaagggaga 540 

aaaggtttca cctccacgaa ttcagtcatt ctacataaaa aaagaacaaa aaggagtgtg 600 

ttgggagggg acaagttttg gcaaaaaaaa tctccagaaa gtccacaagg ggacttccag 660 

gacagtagga gcctcaggtc cacagtgaag tttcacaagt ccacaaggaa gttgcaaggg 720 

gaagtggaaa agagcagtgt tgccagaaat tatggatttg atcctcttta atgcctgtga 780 

agatacapaa aaacatgaaa tttcttttca atatttttgg gctgaaccac ttgggtggtg 840 

gctgaaatga gaccatacat acagcacaat gcatgtctgt tatattgcaa agtgtattac 900 

ctaggtctca tacatccccc tagtcatcag cattaatgca tgttgggacc ccgacccaca 960 

caagcagaag taatg 975 



<210> 246 
<211> 349 
<212> DNA 
<213> Homo sapiens 



<400> 246 

gcccacgcaa tcattgattt cccttggtgg cccattgtcc cagggctgtc tggagcatga 60 

ggccagcagt gcctacgaat ggctgtggag tctttgcgcg ctgcttgaca tgtacactgc 120 

cgggccgacc aagacgcaga cgctccagcc catgggccaa cctaacctga aaggagatgg 180 

aggcttcacc agagaatcaa ccggtttcat gcaacttcct gctgacttca tctccagcct 240 

catttgtcat gaaacatggg ttccaggaaa gcccagtact gctatgcaca gaggccgata 300 
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ctgggcagag ccgattatgc ttccaaaggg tagcggcttt cggaaggag 349 



<210> 247 

<211> 431 

<212> DNA 

<213> Homo sapiens 



<400> 247 

caatgtcgga gaaaaatacc cctcttgttt tgagtggaga aaatcagaag aaagggagag 60 

agatcggggt gtgccgaaag caaagccaat gtgatcatca agataataat agccatacat 12 0 

taagattcag ctcctactcc tcctcctctg gtccagtcac tctggtctcc ttccattccc 180 

acaactatcc aagcaaagtc ctgcttcagg gaaatttgga cacagagaca tgtacagaga 24 0 

gaagacaaag agaaatttgg acacagagac atgaagggaa atgtggacac agagacatgt 300 

acagagagaa gacgaagaga aaatacagag agaaggccat ctacaagcta agaaaaggac 360 

ctgaaacaga tccttcctca cagccctcgg aaagaaccaa ccctgccaac accttgatct 420 

cagactccta g 431 



<210> 248 

<211> 1272 

<212> DNA 

<213> Homo sapiens 

<400> 248 

atgaacaata agctggagac agggcagggg cggcgaggcg gggaggcgtt tatggcatac 60 

gccaccgcag gtgaagcatc tggaacctgg gcatccaggc aacaggctcg caaggcccag 120 

attcaggggc agccacctgc gccaggctgg gacctgacac agggtgagca ggctgggaaa 180 

ttcaccagca ctgcggacag agcaggggat aacagcagca tcagaaagac ccataaggcc 240 

ataggtttgc agagggaatg cagtccctca gcaaggagct tcagtcggac taaaccagca 300 

gaaaaggcgg tttccgagta ccatgctcgc acaacaggga gccgccgacc aggcctggag 360 

atccctgtga cagtcacggc aggactgggg ctgcaggagt gcagagccag gtacaggcct 420 

gggaagccct catctcaccg cgaggacagt cgcgtgcgac aggcctgtaa ggtggcatcc 480 

gagtccctcc cgcaattaag gacgccgggc tccaggccag cgccagggac agacccagca 540 

ccagggcggc ccccggagcc ggcctctggc ggcctcgggt cttttgccaa attccccacc 600 

ggcgccagga taccgagggg ccacccccac cacgcggctc caacagacac gtttcccaca 66 0 

ctggctgcgg aggcccgccg gacggggggc cccgtgctga ccgcagccgc ggagcccgca 720 

gcgcacatcc cggagcacag gacaaaacct gtgcctgcac cggagccccc atccggctcc 780 

cgcaacactg acccccctgg gcagcctcgg gcacggggca cctggaaagc cagccccgga 840 

caccgcgcgg attctgcctc ccggagagct tccttcctgt tccgatgttt ggcgaacctg 900 

cagcgctccc tgaagcagat gagagggaag ctgcactccc agaaagcgca gttttggttc 960 

atattgaatg gatttattgg gggtgtcatc ggcaggcgga tgacagattg tcaggcctgt 1020 

gaacctaggc taagaagcat ccagtgtcaa ctacctgaat cttacaccag cctttgccac 1080 

ccagcagcac tgacccaaag tggacccaag aatgtccttg aaagagacca accatctgcc 1140 

tgcagcctca agacacctgc tcagacctgc ctgcctcagt gctccctaca ctggacatta 1200 

agagacgatc agacgcagcc actcacagcc ccgagcagta caatgaatgg agcctatcgc 1260 

atgaaatgct ga 1272 



<210> 249 

<211> 380 

<212> DNA 

<213> Homo sapiens 



<400> 249 

gctccttctc cctggcctca atcatcttgg ccaatttcca agacagtaca gcacttgcta 60 

ggaacagtgg agtaagggcc aaaataacgg ttgtaaggaa gccatagggg tcctttgcag 120 

cccattccac aacatactca gcccaagcct ttaaaactgg ggcttatgct tgagtagtga 180 
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tcccttaact ttgtcctggt gttattagtg acccaatcac gacttcagat gacccttggt 240 

agtctgtctt cttgagttct actgtacaca tctgagagac ggtttaaatt ctgtacaatt 300 

ggcctatcgg ggctgcagac ccacggaggc cacgttcact cctgcccgtc ggccttggca 360 

agcgcgggcc ccttgtcgcg 380 



<210> 250 
<211> 560 
<212> DNA 
<213> Homo sapiens 



<400> 250 

ctacaaatag agggatctgt ttccgtcact cgcgccctaa cgcaaagggt gggacaaccc 60 

gcgcgcatgc gtcgttgtcc tctgctctcc agggtaaacc ccgccccttt ctttaggcca 120 

gttttaccac agcacatgcg cagtacagga tctgtctgtt cgtttgtcgg cgctaccaat 180 

aaagttttag tgagcacaga ctcccttttc tttggcaaga tggcggagta cgacttgact 240 

actcgcatcg cgcacttttt ggatcggcat ctagtctttc cgcttcttga atttctctct 300 

gtaaaggagg tgagggggtc tttgggcgca aggaaggcgt gggggcggat gaggtgctga 36 0 

gcaaggccga aaggtggtgt ctgtagtcct aggcctgaca cgtgcgaggc ggtcgcgagt 42 0 

ttggttcctg gacggacact aaggatcttc tggtagttta gcgtcgtctc agttaagaag 480 

caccgcaaaa tgcagttagc tctctgttct gcatctgtgg attcagccaa ctgcagattg 540 

aaaatattca gggaaagaaa 560 



<210> 251 

<211> 1092 

<212> DNA 

<213> Homo sapiens 

c220> 

<221> mis cofeature 
<222> (1) . . . (1092) 
<223> n = a,t,c or g 



<400> 251 

cagtacgacg tttaaacacg gccatgcaag caaaattacc ccactaaagg aataagttcg 60 

gcccatcttt tttttttttt ttttgctgct ggaacgtttt attaagttaa gaggttcagg 120 

gagcagaaga gaaacacctt ttgggtggct tctggcggtt ctgcacccaa gcacagcctc 180 

gaaggaggct gtggggccat ggaggcccag ctgctagctc cctctgtcct gagccccatg 240 

gtacgggtcc aaatggggca ggaagtgtta gtaggaggta gggcaggaag agggtgngca 300 

ggctcccgcc tctcccctgt agagacaccg ccgccatggc ttgctgcctg ggcctccgcc 360 

agacctctgg gccagggcca ccagctcaga agcccaagca cagcggtggc tggagacaga 420 

ggg9gaccag ggttggaggc acacagccac caggaattgc tttttttttt ttttttttta 480 

aagtataaag tgttttggaa aaaaaggaaa aaaatctata taaaaatctc ttcacatata 540 

aaatcctgaa gaaggtgcaa ggtgagaccc cagntgcgag gggcgcgcat cagatatgca 600 

gtgtgtgtgt gtgtgtgtgt gtgtgtgtat ccgtgtgtac atgtgtgcac gtgtgtgcgt 660 

atgtgtctgt gtgtctgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgg tgggtgcaag 720 

tgcacgtgtg gcccaacaga gggtggggag aaagcttggc tttttacttc catccaggag 780 

ggaaggaggt gcggctggtc ctccagcctg agagggtctg cagctgggcg ggacctctac 840 

tcagccaggc tgtttgcgca tcgactcctt ctcctggagg gccggccatg gcaagacgca 900 

ggtgctcctt cagctgctcg aatctcccgc tcagagccgt ggtcttgatg gtggctcagc 960 

tccacataga acgtcctggt actttcccga ggtgaagcgc ttgtccttct gcatcatctg 102 0 

gagctcgtcc cggaggcacc tgccctactt tcttaagtac tggagttccg tttcttttaa 1080 

ccgaaaccac ac 1092 



<210> 252 
<211> 246 
<212> PRT 
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<213> Homo sapiens 



<400> 252 



Arg 


Gin 


Ser 


Ser 


Gly 


Asn 


Leu 


Thr 


Met 


Ala 


Trp 


Thr 


Pro 


Leu 


Leu 


Leu 


1 








5 










10 










15 




Pro 


Leu 


Leu 


Thr 


Phe 


Cys 


Thr 


Val 


Ser 


Glu 


Ala 


Ser 


Tyr 


Glu 


Leu 


Thr 








20 










25 










30 






Gin 


Pro 


Pro 


Ser 


Val 


Ser 


Val 


Ser 


Pro 


Gly 


Gin 


Thr 


Ala 


Thr 


He 


Thr 






35 










40 










45 








Cys 


Ser 


Gly 


Asp 


Ala 


Leu 


Pro 


Lys 


Lys 


His 


Pro 


Tyr 


Trp 


Tyr 


Gin 


Gin 




50 










55 










60 










Lys 


Ser 


Gly 


Gin 


Ala 


Pro 


Val 


Leu 


Val 


He 


Tyr 


Glu 


Asp 


Asn 


Lys 


Arg 


65 










70 










75 










80 


Pro 


Ser 


Gly 


He 


Pro 


Glu 


Arg 


Phe 


Ser 


Ala 


Ser 


Ser 


Ser 


Gly 


Thr 


Met 










85 










90 










95 




Ala 


Thr 


Leu 


Thr 


He 


Ser 


Gly 


Ala 


Gin 


Val 


Glu 


Asp 


Glu 


Ala 


Asp 


Tvr 








100 










105 










110 






Tyr 


Cys 


Tyr 


Ser 


Thr 


Asp 


Ser 


Ser 


Gly 


Asn 


His 


Arg 


Gly 


Val 


Phe 


Glv 






115 










120 










125 








Qly 


Gly 


Thr 


Arg 


Leu 


Thr 


Val 


Leu 


Ser 


Gin 


Pro 


Lvs 


Ala 


Ala 


Pro 


Ser 




130 










135 










140 










Val 


Thr 


Leu 


Phe 


Pro 


Pro 


Ser 


Ser 


Glu 


Glu 


Leu 


Gin 


Ala 


Asn 


Lys 


Ala 


145 










150 










155 








160 


Thr 


Leu 


Val 


Cys 


Leu 


He 


Ser 


Asp 


Phe 


Tyr 


Pro 


Gly 


Ala 


val 


Thr 


Val 










165 










170 










175 




Ala 


Trp 


Lys 


Ala 


Asp 


Ser 


Ser 


Pro 


Val 


Lys 


Ala 


Gly 


Val 


Glu 


Thr 


Thr 








180 










185 










190 






Thr 


Pro 


Gly 


Lys 


Gin 


Ser 


Asn 


Asn 


Lvs 


Tvr 


Ala 


Ala 


Ser 


Ser 


Tv:r 


Leu 






195 










200 










205 








Ser 


Leu 


Thr 


Pro 


Glu 


Gin 


Trp 


Lvs 


Ser 


His 


Lvs 


Ser 


Tvr 


Ser 


Cvs 


Gin 




210 










215 










220 










Val 


Thr 


His 


Glu 


Gly 


Ser 


Thr 


Val 


Glu 


Glu 


Thr 


Gly 


Ala 


Pro 


Thr 


Glu 


225 










230 










235 










240 


Tyr 


Leu 


Leu 


Arg 


Val 


Tyr 






























245 


























<210> 253 




























<211> 539 




























<212> PRT 




























<213> Homo 


sapiens 
























<400> 253 


























Met 


Glu 


Lys 


Gly 


Ser 


Gly 


Phe 


He 


Lys 


Tyr 


Ser 


Thr 


Tyr 


Lys 


Gin 


Gly 


1 








5 










10 










15 




Thr 


He 


Arg 


Val 


Ala 


Glu 


Glu 


Ala 


Glu 


Thr 


Ala 


His 


Ser 


Ser 


Val 


Leu 








20 










25 










30 






He 


Gly 


Pro 


Glu 


Lys 


Gly 


Val 


Val 


His 


Leu 


Ala 


Thr 


Ala 


Ala 


Val 


Leu 






35 










40 










45 








Asn 


Ala 


Val 


Trp 


Asp 


Leu 


Trp 


Ala 


Lys 


Gin 


Glu 


Gly 


Lys 


Val 


Leu 


Ala 




50 










55 










60 










Val 


Gly 


Arg 


Glu 


Leu 


Gin 


Glu 


Glu 


Glu 


Lys 


Glu 


Glu 


Thr 


Gly 


Trp 


Arg 


65 










70 










75 










80 


Lys 


Ala 


Gin 


Ala 


Ala 


Val 


Glu 


Gly 


Gly 


Val' 


Gly 


Thr 


Trp 


Trp 


Leu 


Thr 










85 










90 










95 




Ala 


Ser 


He 


Arg 


Ala 


Ala 


Asn 


Ala 


Phe 


Thr 


Val 


Arg 


Lys 


Lys 


Trp 


Gly 








100 










105 










110 






Leu 


Tyr 


Thr 


Tyr 


Val 


Leu 


Gin 


He 


Leu 


Ser 


Phe 


Leu 


Leu 


Gin 


Ala 


Cys 






115 










120 










125 








Leu 


Glu 


Val 


Thr 


Cys 


Gly 


His 


Asp 


Leu 


He 


Met 


Gly 


Cys 


He 


Lys 


Ser 




130 










135 










140 
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Lys 


Glu 


Asn 


Lys Ser 


Pro Ala 


He 


Lys 


145 








150 






Glu 


Pro 


Val 


Ser Thr 


Ser val 


Ser 


His 








165 








Val 


Ser 


Pro 


Cys Pro 


Ser Ser 


Ser 


Ala 








180 






185 


Ser 


Ser 


Leu 


Ser Met 


Thr Pro 


Phe 


Gly 






195 






200 




Phe 


Gly 


Gly 


Ala Ser 


Ser Ser 


Phe 


. Ser 




210 






215 






Ala 


Gly 


Leu 


Thr Gly 


Gly Val 


Thr 


He 


225 








230 






Glu 


Ala 


Arg 


Thr Thr 


Glu Asp 


Leu 


Ser 








245 








Gin 


He 


He 


Asn Asn 


Thr Pro 


Met 


Val 








260 






265 


Pro 


Gly 


Asp 


He Trp 


Gin Tyr 


Leu 


Glu 






275 






280 




Gly 


Val 


Leu 


Pro Leu 


Ser Ser 


Ser 


Ala 




290 






295 






Val 


Thr 


Asn 


Gly Val 


Thr Pro 


Thr 


He 


305 








310 






Val 


Glu 


Asn 


Arg Leu 


Leu Thr 


Ser 


Asn 








325 








He 


Asn 


His 


Ser Asn 


Ser Ser 


Lys 


Thr 








340 






345 


Val 


Val 


Ser 


Arg Gly 


Asn Ser 


Ser 


Glu 






355 






360 




Gin 


Asp 


Gly 


Lys Thr 


Asp His 


Ala 


He 




370 






375 






Thr 


Ala 


Leu 


Gly Arg 


Glu Ala 


Leu 


Glu 


385 








390 






Arg 


Lys 


Val 


Gly Gin 


Arg Ser 


Arg 


Trp 








405 








He 


Ser 


Gly 


Pro He 


Gin Ala 


Glu 


Thr 








420 






425 


Gin 


Trp 


Thr 


Gin Glu 


Phe His 


Thr 


Ser 






435 






440 




Leu 


Pro 


Asp 


He Tyr 


Thr Leu 


Asp 


Cys 




450 






455 






Tyr 


He 


His 


Ser Phe 


Gly Asp 


He 


Thr 


465 








470 






His 


Leu 


Gin 


He Asp 


Asp Tyr 


Gin 


Leu 








485 








Gly 


Trp 


Thr 


Arg Pro 


Pro Pro 


Phe 


His 








500 






505 


Leu 


Leu 


Thr 


Leu Ala 


Cys Ala 


Glu 


Thr 






515 






520 




Asp 


Ala 


Leu 


Ala Lys 


Ser Arg 


Phe 


Lys 




530 






535 







<210> 254 
<211> 236 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> (1) . . . (236) 

<223> Xaa as x or * as defined 
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Tyr 


Arg 


Pro 


Glu 


Asn 


Thr 


Pro 




155 










160 


Tyr 


Gly 


Ala 


Glu 


Pro 


Thr 


Thr 


170 










175 




Lys 


Gly 


Thr 


Ala 


Val 


Asn 


Phe 










190 






Gly 


Ser 


Ser 


Gly 


Val 


Thr 


Pro 








205 








Val 


Val 


Pro 


Ser 


Ser 


Tyr 


Pro 






220 










Phe 


Val 


Ala 


Leu 


Tyr 


Asp 


Tyr 




235 










240 


Phe 


Lys 


Lys 


Gly 


Glu 


Arg 


Phe 


250 










255 




Leu 


Asn 


Leu 


Gly 


Gin 


Asn 


His 










270 






Thr 


Phe 


Leu 


Val 


Val 


Thr 


Val 








285 








Ser 


Thr 


Pro 


Val 


Phe 


Asp 

XT 


Ara 






300 










Lys 


Asp 


Leu 


Thr 


Gly 


Cys 


Cvs 




315 










320 


Ser 


Ser 


Asp 


Phe 


Phe 


Thr 


Leu 


330 










335 




Pro 


Phe 


Gin 


Asn 


Thr 


Ara 


Leu 










350 






Lvs 


Gin 


Phe 


Ala 


He 


Ara 


Phe 








365 








Gin 


Leu 


Ser 


Ser 


Glv 


Lys 


Lys 






380 










His 


Pro 


Glu 


Ser 


Leu 


Asp 


Ser 




395 










400 


Ser 


Ser 


Gin 


Ala 


Ala 


Ser 


Pro 


410 










415 




Ala 


Leu 


Leu 


Cys 


Pro 


Gly 


Asp 










430 






Pro 


Leu 


Leu 


Thr 


Val 


Pro 


His 








445 








Cys 


Arg 


Lys 


Asp 

XT 


Phe 


Ser 


He 






460 










Gin 


Ser 


Tyr 


He 


Phe 


Lys 


Tyr 




475 










480 


Cys 


Ala 


Gin 


Ala 


Leu 


Lys 


Asp 


490 










495 




Thr 


Ala 


His 


Leu 


His 


Phe 


Ser 










510 






Val 


Thr 


Ser 


Ala 


Glu 


Thr 


Pro 



525 

Val Lys 
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<400> 254 



Gly 


Thr 


Arg 


Asp 


Ala 


Thr 


Ala 


Glu 


Glu 


Asn 


Arg 


Val 


Leu 


Leu 


Ala Met 


1 








5 










10 










15 


Val 


Asn 


Pro 


Thr 


Val 


Phe 


Phe 


Asp 


He 


Ala 


Val 


Asp 


Gly 


Glu 


Pro Leu 








20 










25 










30 




Gly 


Arg 


Val 


Ser 


Phe 


Glu 


Val 


Arg 


Gly 


Leu 


Asp 


Thr 


Lys 


Lys 


Xaa Leu 






35 










40 










45 






Leu 


lie 


Xaa 


Ser 


He 


Lys 


Leu 


Cys 


Xaa 


Gin 


He 


Gly 


Gly 


Ser 


Ser He 




50 










55 










60 








Phe 


lie 


Thr 


Ser 


Asp 


Xaa 


Lys 


Asn 


Ser 


Cys 


Leu 


Pro 


Leu 


He 


Val Gin 


65 










70 










75 








80 


Gin 


Cys 


Leu 


Leu 


Phe 


Leu 


Arg 


He 


Leu 


Pro 


Leu 


Phe 


Ala 


Asp 


Lys Val 










85 










90 










95 


Pro 


Lys 


Thr 


Ala 


Glu 


Asn 


Phe 


Arg 


Ala 


Leu 


Ser 


Thr 


Gly 


Glu 


Lys Gly 








100 










105 










110 




Phe 


Gly 


Leu 


Xaa 


Gly 


Val 


Pro 


Cys 


Phe 


His 


Arg 


He 


He 


Pro 


Gly Phe 






115 










120 










125 






Met 


Cys 


Gin 


Gly Gly 


Asp 


Cys 


Glu 


Arg 


His 


His 


Asn 


Gly 


Thr 


Gly Gly 




130 










135 










140 








Lys 


Ser 


He 


Tyr 


Thr 


Glu 


Lys 


Phe 


Glu 


Asp 


Glu 


Asn 


Phe 


He 


Leu Lys 


145 










150 










155 








160 


Ala 


Tyr 


Gly 


Val 


Leu 


Gly 


Ser 


Leu 


Ser 


Met 


Ala 


Asn 


Ala 


Gly 


Pro Asn 










165 










170 










175 


Thr 


Asn 


Gly 


Ser 


Gin 


Phe 


Phe 


He 


Cys 


Thr 


Ala 


Lys 


Thr 


Glu 


Trp Leu 








180 










185 










190 




Asp 


Gly 


Lys 


Pro 


Val 


Val 


Phe 


Gly 


Lys 


Val 


Lys 


Glu 


Gly 


Met 


Asn He 






195 










200 










205 






Val 


Glu 


Ala 


Met 


Glu 


Arcr 


Phe 


Glv 


Ser 


Arcr 


Asn 


Glv 


Lys 


Thr 


Ser Lvs 




210 










215 










220 








Lys 


He 


He 


Ser 


He 


Ala 


Asp 


Cys 


Gly 


Gin 


Leu 


Glu 








225 










230 










235 












<210> 255 


























<211> 105 


























<212> PRT 


























<213> Homo 


sapiens 






















<220> 




























<221> misc 


feature 






















<222> i 


(1) 


(105) 






















<223> Xaa = X or * 


as defined in Table 


6 










<400> 255 
























lie 


Cys 


His 


Gin 


Pro 


Thr 


Ser 


Leu 


Ser His 


Gin 


Xaa 


Glu 


Arg 


Asn Gly 


1 








5 










10 










15 


Trp 


Pro 


Gly 


Arg Trp His 


Ser Gin Pro Arg 


Ser 


Lys 


Phe 


Cys 


Arg Leu 








20 










25 










30 




Arg 


Val 


Leu 


Ser Gin Gly 


Asp His Glu Lys 


Lys 


Ser 


Leu 


Pro 


Ser Gly 






35 










40 










45 






Gin 


Arg 


Pro 


His 


Arg Thr 


Gly Cys Ala Ser 


Ser 


Ser 


Gly 


Ser 


Ser Lys 




50 










55 










60 








Gly 


Leu 


Leu 


Leu 


Leu 


Pro 


Leu Asp Gly Leu 


Gly 


Val 


He 


Val 


Leu He 


65 










70 










75 








80 


Asn 


Pro 


His 


Leu 


Val 


Val 


Phe Pro Gly Met 


Arg 


Ala 


Pro 


Xaa 


Leu Leu 










85 










90 










95 


Pro 


Arg 


Leu 


Trp Leu Arg 


Arg 


Trp 


Ser 




















100 










105 















<210> 256 
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<211> 128 
<212> PRT 
<213> Homo sapiens 

<220> 

<22l> misc_feature 
<222> (1) . . . (128) 

<223> Xaa = X or * as defined in Table 6 



<400> 256 



val 


Arg 


Asp 


Tyr Asn Leu Thr Glu 


l 






5 


Tyr 


Pro 


Pro 


Val Asn Arg Lys Tyr 








20 


Val 


Gin 


Trp 


lie Val Leu His Arg 






35 


40 


His 


Ala 


Trp 


Gly Asp Thr Leu Glu 




50 




55 


Ala 


Met 


Phe 


Gly Tyr Met Thr Asp 


65 






70 


Thr 


Val 


Glu 


Val Glu Thr Gin Gly 








85 


Lys 


His 


Pro 


Gin Gly Thr Glu Val 








100 


Gin 


Val 


Tyr 


Asn Glu Glu Asn Pro 






115 


120 



Glu Gin Lys Ala He Lys Ala Lys 

10 15 
Glu Tyr Leu Asp His Thr Ala Asp 

25 30 
Ala Xaa He Tyr Phe Phe Arg Leu 
45 

Glu Ala Phe Glu Gin Cys Ala Met 
60 

Thr Gly Thr Val Glu Pro Leu Gin 
75 80 
Trp Gly Glu Glu Phe Ser Leu Ser 

90 95 
Lys Ala He Thr Tyr Ser Ala Met 
105 110 
Glu Val Phe Val He He Asp He 
125 



<210> 257 
<211> 111 
<212> PRT 
<213> Homo sapiens 



<400> 257 



Val 


Thr 


Ser 


Ser Cys Pro Arg Lys 


1 






5 


Pro 


Ser 


Ser 


Ser Phe Ser Pro Pro 








20 


Ala 


Pro 


Arg 


Glu Gly Arg Arg Gly 






35 


40 


Pro 


Ser 


Gin 


Pro Leu Asp Ser His 




50 




55 


Leu 


Pro 


Pro 


Ser Pro Arg His Pro 


65 






70 


Arg 


Ser 


Gly 


Pro Arg Arg Lys Arg 








85 


Cys 


Lys 


Glu 


Glu Glu Gly Glu Lys 



100 



Lys Arg Arg Phe Gly Gly Asp Arg 

10 15 
Ser Lys Glu Leu Leu Ala Val Lys 

25 30 
Pro Gly Asn Glu Ser Arg Ser Glu 
45 

Gly Pro Gly Leu Arg Arg Thr Phe 
60 

Thr Lys Asp Arg Arg Thr Ala Ala 
75 80 
Gly Gin Thr Asn Glu He Arg Gly 

90 95 
Tyr Leu Val Pro Ala Gin Gly 
105 110 



<210> 258 
<211> 224 
<212> PRT 
<213> Homo sapiens 



<400> 258 

Phe Tyr Phe Val Pro. Ser Gin Glu Ser Val Pro Ser Ala Ser Pro Thr 
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1 








5 


10 






15 


Gly 


lie 


Pro 


Lys 


His Ser Leu Arg Lys 


Thr 


Thr 


Ser 


Thr Glu Glu Pro 








20 


25 








30 


Arg 


Gly 


Thr 


His 


Ser Gin Gly Gin Phe 


Thr 


Met 


Pro 


Leu Ala Gly Met 






35 




40 








45 


Ser 


Leu 


Gly 


Ser 


Leu Lys Ser Glu Phe 


Val 


Pro 


Leu 


Phe Ser Ala Thr 




50 






55 






60 




Pro 


Phe 


Trp 


Val 


Pro Phe Ser Ser Leu 


Pro 


Leu 


Phe 


Pro Trp Val Leu 


65 








70 




75 




80 


Val 


Glu 


Asp 


His 


Val Cys Leu Leu Asp 


Cys Val Val 


Val Asp Leu Gin 










85 


90 






95 


Asp 


Met 


Asp 


He 


Phe Ala Ala Glu Arg 


His 


Pro Arg 


Asp Tyr Ser Lys 








100 


105 








110 


Ala 


Pro 


Glu 


Asp 


Ser Ser Gly Asp Leu 


He 


Phe 


Pro 


Ser Tyr Phe Val 






115 




120 








125 


Arg 


Gin 


Thr 


Gly 


Gly Ser Leu Leu Thr 


Glu 


Pro 


Cys 


Arg Leu Lys Leu 




130 






135 






140 




Gin 


Val 


Glu 


Arg 


Asn Leu Asp Lys Glu 


He 


Ser 


His 


Thr Val Pro Asp 


145 








150 




155 




160 


lie 


Ser 


He 


His 


Gly Asn Leu Ser Ser 


Val 


His 


Cys 


Ser Leu Asp Leu 










165 


170 






175 


Tyr 


Lys 


Tyr 


Lys 


Leu He Arg Gly Leu 


Leu 


Glu 


Asn 
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Gly 


Arg 


Arg 


Phe 


Ser 








980 










Gin 


Ala 


Val 


His 


Thr 


Gly 


Ser 


Arg 






995 








1000 


Arg 


Ser 


Ser 


Phe 


Ser 


Ser 


Lys Thr 


1010 








1015 




His 


Thr 


Gly 


Ser 


Arg 


Pro 


Phe 


Ser 


L025 








1030 






Ser Arg 


Lys 


Thr 


His 


Leu Val Arg 








1045 








Ala 


His 


Ala 


Ala 


Pro Asp Ala Ala 






1060 










Pro 


Pro 


Glu 


Val 


Ala 


Pro 


Pro 


Pro 



1075 1080 



585 










590 






His 


Leu 


Ser 


Leu 


Leu 


Pro 


Ser 


Pro 










605 








Leu 


Ala 


Ala 


Ala 


Tyr 


Lys 


Leu 


Glu 








620 










Gly 


Leu 


Ala 


Leu 


Ser 


Gly 


Trp 


Gly 






635 










640 


Cys 


Gly 


Glu 


Cys 


Glu 


Arg 


Arg 


Phe 




650 










655 




His 


Gin 


Arg 


Leu 


His 


Arg 


Gly 


Glu 


665 










670 






Gly 


Arg 


Ser 


Phe 


Thr 


Gin 


Arg 


Ala 










685 








His 


Arg 


Gly 


Glu 


Arg 


Pro 


Phe 


Pro 








700 










Ser 


Lys 


Lys 


Ala 


His 


Leu 


Thr 


Arg 






715 










720 


Arg 


Pro 


Tyr 


Pro 


Cys 


Ala 


Glu 


Cys 




730 










735 




His 


Leu 


Gly 


Ser 


His 


Gin 


Lys 


Thr 


745 










750 






Cys 


Thr 


Glu 


Cys 


Glu 


Lys 


Arcr 


Phe 










765 








His 


Gin 


Arg 


He 


His 


Thr 


Gly 


Glu 








780 










Ala 


Arg 


Ser 


Phe 


Thr 


His 


Lys 


Gin 






795 










800 


His 


Gin 


Thr 


Ala 


Gly 


Pro 


Ala 


Arcr 




810 










815 




Pro 


His 


Ser 


Thr 


Ala 


Pro 


Ser 


Pro 


825 










830 






Pro 


Phe 


Ala 


Cys 


Ser 


Asp 


Cys 


Gly 










845 








Leu 


Ala 


Thr 


His 


Gin 


Cys 


Leu 


His 








860 










Cys 


Asp 


Glu 


Cys 


Ala 


Leu 


Gly 


Ala 






875 










880 


Pro 


Leu 


Ala 


Ser 


Ala 


Pro 


Gly 


Gly 




890 










895 




Pro 


Val 


Val 


Pro 


Gin 


Arcr 


Ala 


Pro 


905 










910 






Pro 


Asp 


Cys 


Gly 


Arg 


Gly 


Phe 


Ser 










925 








Pro 


Arg 


Val 


His 


Thr 


Gly 


Glu 


Arg 








940 










Arg 


Arg 


Phe 


Gly 


Ser 


Arg 


Pro 


Asn 






955 










960 


Ser 


Gly 


Ala 


Arg 


Pro 


Phe 


Ala 


Cys 




970 










975 




Arg 


Lys 


Ser 


His 


Leu 


Gly 


Arg 


His 


985 










990 






Pro 


His 


Ala 


Cys 


Ala 


Val 


Cys 


Ala 








1005 








Asn 


Leu 


Val 


Arg His 


Gin 


Gly 


lie 






1020 










Cys 


Pro 


Gin 


Cys Gly 


Lys 


Ser 


Phe 




1035 








1040 


His 


Gin 


Leu 


He 


His 


Gly 


Glu 


Ala 


1050 








1055 




Leu 


Ala 


Ala 


Pro 


Ala 


Trp 


Ser 


Ala 



1070 



Leu Phe Phe 
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<210> 262 
<211> 738 
<212> PRT 

<213> Homo sapiens 
<220> 

<22l> misc_feature 
<222> (1) . . . (738) 

<223> Xaa = X or * as defined in Table 6 



<400> 262 



lie 


Thr 


Met 


Gly 


Ser 


Ser 


Gly 


Leu 


Gly 


Lys 


Ala 


Ala 


Thr Leu Asp Glu 


1 








5 










10 






15 


Leu 


Leu 


Cys 


Thr 


Cys 


He 


Glu 


Met 


Phe 


Asp 


Asp 


Asn 


Gly Glu Leu Asp 








20 










25 








30 


Asn 


Ser 


Tyr 


Leu 


Pro 


Arq 


He 


Val 


Leu 


Leu 


Met 


His 


Arg Trp Tyr Leu 






35 










40 










45 


Ser 


Ser 


Thr 


Glu 


Leu 


Ala 


Glu 


Lvs 


Leu 


Leu 


Cvs 


Met 


Tvr Arcr Asn Ala 




50 










55 










60 




Thr 


Gly 


Glu 


Ser 


Cvs 


Asn 


Glu 


Phe 


Arq 


Leu 


LVS 


He 


Cys Tyr Phe Met 


65 










70 










75 




80 


Arg 


Tyr 


Trp 


He 


Leu 


Lys 


Phe 


Pro 


Ala 


Glu 


Phe 


Asn 


Leu Asp Leu Gly 










85 










90 






95 


Leu 


He 


Arg 


Met 


Thr 


Glu 


Glu 


Phe 


Arg 


Glu 


Val 


Ala 


Ser Gin Leu Gly 








100 










105 








110 


Tyr 


Glu 


Lys 


His 


Val 


Ser 


Leu 


He 


Asp 


He 


Ser 


Ser 


He Pro Ser Tyr 






115 










120 










125 


Asp 


Trp 


Met 


Arg 


Arg 


Val 


Thr 


Gin 


Arg 


Lys 


Lys 


Val 


Ser Lys Lys Gly 




130 










135 










140 




Lys 


Ala 


Cys 


Leu 


Leu 


Phe 


Asp 


His 


Leu 


Glu 


Pro 


He 


Glu Leu Ala Glu 


145 










150 










155 




160 


His 


Leu 


Thr 


Phe 


Leu 


Glu 


His 


Lys 


Ser 


Phe 


Arg 


Arg 


He Ser Phe Thr 










165 










170 






175 


Asp 


Tyr 


Gin 


Ser 


Tyr 


Val 


He 


His 


Gly 


Cys 


Leu 


Glu 


Asn Asn Pro Thr 








180 










185 








190 


Leu 


Glu 


Arg 


Ser 


lie 


Ala 


Leu 


Phe 


Asn 


Gly 


He 


Ser 


Lys Trp Val Gin 






195 










200 










205 


Leu 


Met 


Val 


Leu 


Ser 


Lys 


Pro 


Thr 


Pro 


Gin 


Gin 


Arg 


Ala Glu Val He 




210 










215 










220 




Thr 


Lys 


Phe 


He 


Asn 


Val 


Ala 


Lys 


Lys 


Leu 


Leu 


Gin 


Leu Lys Asn Phe 


225 










230 










235 




240 


Asn 


Asn 


Leu 


He 


Ala 


He 


Val 


Gly 


Ala 


Leu 


Ser 


His 


Arg Ser He Ser 










245 










250 






255 


Gly 


Phe 


Lys 


Gly 


Thr 


His 


Ser 


His 


Leu 


Ser 


Ser 


Glu 


Val Thr Lys Asn 








260 










265 








270 


Trp 


Asn 


Val 


Lys 


Xaa 


Gin 


Lys 


Trp 


Val 


Ser 


Ser 


Asn 


Gly Asn Tyr Cys 






275 










280 










285 


Asn 


Tyr 


Arg 


Lys 


Pro 


Phe 


Ala 


Asp 


Cys 


Asp 


Gly 


Phe 


Lys He Pro He 




290 










295 










300 




Leu 


Gly 


Val 


His 


Leu 


Lys 


Asp 


Leu 


He 


Ala 


Val 


His 


Val He Phe Pro 


305 










310 










315 




320 


Asp 


Trp 


Thr 


Glu 


Glu 


Asn 


Lys 


Val 


Asn 


He 


Val 


Lys 


Met His Gin Leu 










325 










330 






335 


Ser 


val 


Thr 


Leu 


Ser 


Glu 


Leu 


Val 


Ser 


Leu 


Gin 


Asn 


Ala Ser His His 








340 










345 








350 


Leu 


Glu 


Pro 


Asn 


Met 


Asp 


Leu 


He 


Asn 


Leu 


Leu 


Thr 


Leu Ser Leu Asp 






355 










360 










365 


Leu 


Tyr 


His 


Thr 


Glu 


Asp 


Asp 


lie 


Tyr 


Lys 


Leu 


Ser 


Leu Val Leu Glu 




370 










375 










380 




Pro 


Arg 


Asn 


Ser 


Lys 


Ser 


Gin 


Pro 


Thr 


Ser 


Pro 


Thr 


Thr Pro Asn Lys 


385 










390 










395 




400 
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Pro Val 


Val 


Pro 


Leu 


Glu 


Trp 


Ala 


Leu 


Gly 


Val Met Pro 


Lys Pro 


Asp 








405 










410 




415 




Pro Thr 


Val 


lie 


Asn 


L Y S 


His 


lie 


Arg 


Lys 


Leu Val Glu 


Ser Val 


Phe 






420 










425 






430 




Arg Asn 


Tyr 


Asp 


His 


Asp 


His 


Asp 


Gly 


Tyr 


He Ser Gin 


Glu Asp 


Phe 




435 










440 






445 






Glu Ser 


lie 


Ala 


Ala 


Asn 


Phe 


Pro 


Phe 


Leu 


Asp Ser Phe 


Cys Val 


Leu 


450 










455 








460 






Asp Lys 


Asp 


Gin 


Asp 


Gly 


Leu 


He 


Ser 


Lys 


Asp Glu Met 


Met Ala 


Tyr 


465 








470 










475 




480 


Phe Leu 


Arg 


Ala 


Lys 


Ser 


Gin 


Leu 


His 


Cys 


Gin He Gly 


Ala Pro 


Gly 








485 










490 




495 




Phe He 


His 


Asn 


Phe 


Gin 


Glu 


Met 


Thr 


Tyr 


Leu Lys Pro 


Thr Phe 


Cys 






500 










505 






510 




Glu His 


Cys 


Ala 


Gly 


Phe 


He 


Leu 


Gly 


He 


He Lys Gin 


Gly Tyr 


Lys 




515 










520 






525 






Cys Lys 


Asp 


Cys 


Gly 


Ala 


Asn 


Cys 


His 


Lys 


Gin Cys Lys 


Asp Leu 


Leu 


530 










535 








540 






Val Leu 


Ala 


Cys 


Arg 


Arg 


Phe 


Ala 


Arg 


Ala 


Pro Ser Leu 


Ser Ser 


Gly 


545 








550 










555 




560 


His Gly 


Ser 


Leu 


Pro 


Gly 


Ser 


Pro 


Ser 


Leu 


Pro Pro Ala 


Gin Asp 


Xaa 








565 










570 




575 




Val Phe 


Lys 


Phe 


Pro 


Gly 


Val 


Thr 


Ala 


Asp 


Asn Ser Asp 


Leu Asp 


Ser 






580 










585 






590 




Arg Ala 


lie 


Thr 


Leu 


Val 


Thr 


Gly 


Ser 


Ser 


Arg Lys Thr 


Ser Val 


Arg 




595 










600 






605 






Leu Gin 


Arg 


Ala 


Thr 


Thr 


Ser 


Gin 


Ala 


Thr 


Gin Thr Glu 


Pro Val 


Trp 


610 










615 








620 






Ser Glu 


Ala 


Gly 


Trp 


Gly 


Asp 


Ser 


Gly 


Ser 


His Thr Leu 


Pro Tyr 


Asn 


625 








630 










635 




640 


Arg Tyr 


Ser 


Gly 


Ser 


Leu 


His 


Lys 


Pro 


Ala 


Lys Arg His 


Lys Gly 


Phe 








645 










650 




655 




Ala He 


Trp 


Glu 


Lys 


Xaa 


Lys 


Ser 


Pro 


Gly 


Trp His Ala 


Gly Gly 


Asp 




t 


660 










665 






670 




Val Xaa 


Asn 


Pro 


Gly 


Thr 


Glu 


Phe 


Glu 


Leu 


Ala Pro Asp 


Glu Gly 


Glu 




675 










680 






685 






Lys Thr 


Thr 


Gin 


Asp 


Gly 


Glu 


Asp 


Gly 


Leu 


Thr Ser Arg 


Leu Ala 


Glu 


690 










695 








700 






Asn Leu 


Lys 


Ala 


Asn 


Asn 


Gly 


Trp 


Leu 


Leu 


Gly Gly Gly 


Lys Asn 


Lys 


705 








710 










715 




720 


Lys Leu 


Leu 


Arg 


Lys 


Ala 


Leu 


Ala 


Ser 


Gin 


Glu Val He 


Leu Glu 


Arg 








725 










730 




735 





Thr Pro 



<210> 263 
<211> 80 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (l) . . . (80) 

<223> Xaa = X or * as defined in Table 6 



<400> 263 
Arg Pro Lys Met Gly Arg Arg Ser 

1 5 
Gin Val Ser Ser Leu Ser Asn Pro 
20 

Cys Phe Phe Xaa Leu Tyr Phe Ser 



Lys His Lys Pro Pro Ala Ser Phe 

10 15 
Gly Phe Phe Phe Phe He Xaa His 

25 30 
Tyr Lys Arg Asn Val Ser Leu Xaa 
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35 40 



Lys 


lie 


Thr 


His 


Tyr Arg 


Lys 


He 


Leu 




50 










55 






Glu 


Thr 


Lys 


Phe 


Phe 


Pro 


Met 


Lys 


Thr 


65 










70 










<210> ; 


264 














<211> ( 


544 














<212> PRT 














<213> Homo 


sapiens 










<400> 264 












Met 


Pro 


Ala 


Pro 


Arg 


Ala 


Arg 


Glu 


Gin 


1 








5 










Gin 


Pro 


Leu 


Leu 


Pro 


Arq 


Gly 


Ala 


Arg 








20 










25 


Ala 


Ala 


Gly 


Ala 


Ala 


Val 


Leu 


Leu 


Val 






35 










40 




Phe 


Phe 


Gly 


Val 


Thr 


Ala 


Asn 


Leu 


Val 




50 










55 






Phe 


Asn 


Trp 


Thr 


Glv 


Glu 


Gin 


Ala 


Thr 


65 










70 








Gly 


Ala 


Ser 


Tyr 


Leu 


Leu 


Ala 


Pro 


Val 








85 










Tyr 


Leu 


Gly 


Arg 


Tyr 


Arg 


Ala 


Val 


Ala 








100 










105 


Ala 


Ala 


Ser 


Gly 


Leu 


Leu 


Pro 


Ala 


Thr 






115 










120 




Ser 


Phe 


Cys 


Gly 


Glu 


Met 


Pro 


Ala 


Ser 




130 










135 






Ser 


Ala 


Gly 


Cys 


Pro 


Arg 


Ser 


Ser 


Pro 


145 










150 








Leu 


Tyr 


Ala 


Gly 


Leu 


Leu 


Leu 


Leu 


Gly 










165 










Ser 


Asn 


Leu 


Thr 


Ser 


Phe 


Gly 


Ala 


Asp 








180 










185 


Asp 


Ala 


Thr 


Arg 


Arg 


Phe 


Phe 


Asn 


Trp 






195 










200 




Gly 


Ala 


Val 


Leu 


Ser 


Leu 


Leu 


Val 


Val 




210 










215 






Ser 


Phe 


Leu 


Leu 


Gly 


Tyr 


Ser 


He 


Pro 


225 










230 








Phe 


Phe 


He 


Phe 


Leu 


Phe 


Ala 


Thr 


Pro 










245 










Met 


Gly 


Ser 


Gin 


Val 


Ser 


Ser 


Met 


Leu 








260 










265 


Cys 


Pro 


Gin 


Leu 


Trp 


Gin 


Arg 


His 


Ser 






275 










280 




Gly 


Gin 


Gin 


Gly 


Asn 


Asn 


Gly 


Ser 


Glu 




290 










295 






Ala 


Lys 


Trp 


Gin 


His 


Thr 


Leu 


Gly 


Arg 


305 










310 








Phe 


Gly 


Asp 


Asp 


Tyr 


Thr 


Asn 


He 


Val 










325 










Ser 


Ala 


Arg 


Leu 


Leu 


Asp 


Lys 


Lys 


Arg 








340 










345 


Leu 


Ala 


Asp 


Glu 


Arg 


Ser 


Pro 


Gin 


Pro 






355 










360 




He 


Ala 


Asn 


Phe 


Gin 


Val 


Leu 


Val 


Lys 
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45 








Arg Arg 


Lys 


Thr 


Phe 


Thr 


Ser 




60 










Glu Pro 


Lys 


Arg 


Val 


Ser 


Gly 


75 










80 


Pro Arg 


Val 


Pro 


Gly 


Glu 


Ara 


10 








15 




Gly Pro 


Arq 


Arq 


Trp 


Arq 


Arq 








30 






Glu Met 


Leu 


Glu 


Arq 


Ala 


Ala 






45 








Leu Tyr 


Leu 


Asn 


Ser 


Thr 


Asn 




60 










Arg Ala 


Ala 


Leu 


Val 


Phe 


Leu 


75 










80 


Gly Gly 


Trp 


Leu 


Ala 




Val 


90 








95 




Leu Ser 


Leu 


Leu 


Leu 


Tvr 


Leu 








110 






Ala Phe 


Pro 


Asp 


Glv 


Arcr 


Ser 






125 








Pro Leu 


Glv 


Pro 


Ala 


Cys 


Pro 




140 










Ser Pro 


Tvx 
j. jr *. 


Cys 


Ala 


Pro 


Val 


155 










160 


Leu Ala 


Ala 


Ser 


Ser 


Val 


Ara 


170 








175 




Gin Val 


Met 


Aso 


Leu 


Glv 


Arcr 








190 






Phe Tyr 


Trp 


Ser 


He 


Asn 


Leu 






205 








Ala Phe 


He 


Gin 


Gin 


Asn 


He 




220 










Val Gly 


Cys 


Val 


Gly 


Leu 


Ala 


235 










240 


Val Phe 


He 


Thr 


Lys 


Pro 


Pro 


250 








255 




Lys Leu 


Ala 


Leu 


Gin 


Asn 


Cys 








270 






Ala Arg 


Ser 


Lys 


Leu 


Ser 


Gin 






285 








Ser Lys 


Leu 


His 


Leu 


Leu 


Val 




300 










Val Glu 


Leu 


Thr 


Val 


Ala 


Val 


315 










320 


Pro Phe 


Gly 


He 


Ser 


Lys 


Asp 


330 








335 




Asp Arg 


Gin 


Cys 


Ala 


Arg 


Val 








350 






Gly Ala 


Ser 


Pro 


Gin 


Glu 


Asp 






365 








He Leu 


Pro 


Val 


Met 


Val 


Thr 
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370 










375 










380 




Leu 


Val' 


Pro Tyr 


Trp 


Met 


Val 


Tyr 


Phe 


Gin 


Met 


Gin 


Ser Thr Tyr Val 


385 










390 










395 




400 


Leu 


Gin 


Gly Leu 


His 


Leu 


His 


He 


Pro 


Asn 


He 


Phe 


Pro Ala Asn Pro 










405 










410 






415 


Ala 


Asn 


lie 


Ser 


Val 


Ala 


Leu 


Arg 


Ala 


Gin 


Gly 


Ser 


Ser Tyr Thr He 








420 










425 








430 


Pro 


Glu 


Ala 


Trp 


Leu 


Leu 


Leu 


Ala 


Asn 


Val 


Val 


Val 


Val Leu He Leu 






435 










440 










445 


Val 


Pro 


Leu Lys 


Asp Arg 


Leu 


He 


Asp 


Pro 


Leu 


Leu 


Leu Arg Cys Lys 




450 










455 










460 




Leu 


Leu 


Pro 


Ser 


Ala 


Leu 


Gin 


Lys 


Met 


Ala 


Leu 


Gly 


Met Phe Phe Gly 


465 










470 










475 




480 


Phe 


Thr 


Ser 


Val 


He 


Val 


Ala 


Gly 


val 


Leu 


Glu 


Met 


Glu Arg Leu His 










485 










490 






495 


Tyr 


lie 


His 


His 


Asn 


Glu 


Thr 


Val 


Ser 


Gin 


Gin 


He 


Gly Glu Val Leu 








500 










505 








510 


Tyr 


Asn 


Ala 


Ala 


Pro 


Leu 


Ser 


He 


Trp 


Trp 


Gin 


He 


Pro Gin Tyr Leu 






515 










520 










525 


Leu 


lie 


Gly 


He 


Ser 


Glu 


He 


Phe 


Ala 


Ser 


He 


Pro 


Gly Leu Glu Phe 




530 










535 










540 




Ala 


Tyr 


Ser 


Glu 


Ala 


Pro 


Arcr 


Ser 


Met 


Gin 


Gly 


Ala 


He Met Gly He 


545 










550 










555 




560 


Phe 


Phe 


Cys 


Leu 


Ser Gly 


Val 


Gly 


Ser 


Leu 


Leu 


Glv 


Ser Ser Leu Val 










565 










570 






575 


Gly 


Thr 


Ala 


Val 


Pro 


Leu 


Pro 


Gly 


Gly 


Trp 


Leu 


His 


Cys Pro Lys Asp 








580 










585 








590 


Phe 


Gly 


Asn 


He 


Asn 


Asn 


Cys 


Arg 


Met 


Asp 


Leu 


Tyr 


Phe Phe Leu Leu 






595 










600 










605 


Ala 


Gly 


lie 


Gin 


Ala 


Val 


Thr 


Ala 


Leu 


Leu 


Phe 


Val 


Trp He Ala Gly 




610 










615 










620 


Arg 


Tyr 


Glu Arg 


Ala 


Ser 


Gin 


Gly 


Pro 


Ala 


Ser 


His 


Ser Arg Phe Ser 



625 630 635 640 

Arg Asp Arg Gly 



<210> 265 

<211> 99 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (99) 

<223> Xaa s x or * as defined in Table 6 



<400> 265 



Xaa 


Ala 


Cys 


Ser 


Gly 


Val 


Pro Gly 


Thr 


Lys 


Cys 


Ser 


Pro Pro Ser Gly 


1 








5 








10 






15 


Ser 


Gly 


Tyr 


Pro 


Asn 


Pro 


Tyr Ser 


Lys 


His 


Val 


Leu 


Thr Glu Asp He 








20 








25 








30 


Val 


His 


Arg 


Glu 


Val 


Thr 


Pro Asp 


Gin 


Lys 


Leu 


Leu 


Ser Arg Ala Thr 






35 








40 










45 


Leu 


Thr 


Lys 


Thr 


Asn 


Arg 


Asn Ala 


His 


Ala 


Gly 


Pro 


Glu Arg Leu Phe 




50 










55 








60 




Pro 


Ala 


Asn 


Val 


Ala 


His 


Ser Val 


Tyr 


Val 


Leu 


Glu 


Asp Ser He Val 


65 










70 








75 




80 


Asp 


Pro 


Gin 


Asn 


Gin 


Thr 


Leu Thr 


Thr 


Phe 


Asn 


Trp 


Asn He Asn Pro 










85 








90 






95 


Arg 


Pro 


Gly 
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<210> 266 
«c211> 840 
<212> PRT 

<213> Homo sapiens 



<400> 266 



Ser 


Ser 


Leu 


Thr 


Ser 


Ser 


Met 


Glu 


Asp 


Pro 


Ala 


Ala 


Pro 


Gly 


Thr 


Gly 


1 








5 










10 










15 




Gly 


Pro 


Pro 


Ala 


Asn 


Gly 


Asn 


Gly 


Asn 


Gly 


Gly 


Gly 


Lys 


Gly 


Lys 


Gin 








20 










25 










30 






Ala 


Ala 


Pro 


Lys 


Gly 


Arg 


Glu 


Ala 


Phe 


Arg 


Ser 


Gin 


Arg 


Arg 


Glu 


Ser 






35 










40 










45 








Glu 


Gly 


Ser 


Val 


Asp 


Cys 


Pro 


Thr 


Leu 


Glu 


Phe 


Glu 


Tyr 


Gly 


Asp 


Ala 




50 










55 










60 










Asp 


Gly 


His 


Ala 


Ala 


Glu 


Leu 


Ser 


Glu 


Leu 


Tyr 


Ser 


Tyr 


Thr 


Glu 


Asn 


65 










70 










75 










80 


Leu 


Glu 


Phe 


Thr 


Asn 


Asn 


Arg 


Arcr 


Cvs 


Phe 


Glu 


Glu 


ASP 


Phe 


Lvs 


Thr 










85 










90 










95 




Gin 


Val 


Gin 


Gly 


Lys 


Glu 


Trp 


Leu 


Glu 


Leu 


Glu 


Glu 


Asp 


Ala 


Gin 


Lvs 








100 










105 










110 






Ala 


Tyr 


He 


Met 


Gly 


Leu 


Leu 


Asp 


Arcr 


Leu 


Glu 


Val 


Val 


Ser 


Arcr 


Glu 






115 










120 










125 








Arg 


Arg 


Leu 


Lys 


Ala 


Ala 


Arg 


Ala 


val 


Leu 


Tyr 


Leu 


Ala 


Gin 


Glv 


Thr 




130 










135 










140 










Phe 


Gly 


Glu 


Cys 


Asp 


Ser 


Glu 


Val 


Asp 

* XT 


Val 


Leu 


His 


Trp 


Ser 


Arcr 


Tvr 


145 










150 










155 










160 


Asn 


Cys 


Phe 


Leu 


Leu 


Tyr 


Gin 


Met 


Gly 


Thr 


Phe 


Ser 


Thr 


Phe 


Leu 


Glu 










165 










170 










175 




Leu 


Leu 


His 


Met 


Glu 


He 


Asp 


Asn 


Ser 


Gin 


Ala 


Cys 


Ser 


Ser 


Ala 


Leu 








180 










185 










190 






Arg 


Lys 


Pro 


Ala 


Val 


Ser 


lie 


Ala 


Asp 


Ser 


Thr 


Glu 


Leu 


Arg 


Val 


Leu 






195 










200 










205 








Leu 


Ser 


Val 


Met 


Tyr 


Leu 


Met 


Val 


Glu 


Asn 


He 


Arg 


Leu 


Glu 


Arg 


Glu 




210 










215 










220 










Thr 


Asp 


Pro 


Cys 


Gly 


Trp 


Arg 


Thr 


Ala 


Arg 


Glu 


Thr 


Phe 


Arg 


Thr 


Glu 


225 










230 










235 










240 


Leu 


Ser 


Phe 


Ser 


Met 


His 


Asn 


Glu 


Glu 


Pro 


Phe 


Ala 


Leu 


Leu 


Leu 


Phe 










245 










250 










255 




Ser 


Met 


Val 


Thr 


Lys 


Phe 


Cys 


Ser 


Gly 


Leu 


Ala 


Pro 


His 


Phe 


Pro 


He 








260 










265 










270 






Lys 


Lys 


Val 


Leu 


Leu 


Leu 


Leu 


Trp 


Lys 


Val 


Val 


Met 


Phe 


Thr 


Leu 


Gly 






275 










280 










285 








Gly 


Phe 


Glu 


His 


Leu 


Gin 


Thr 


Leu 


Lys 


Val 


Gin 


Lys 


Arg 


Ala 


Glu 


Leu 




290 










295 










300 










Gly 


Leu 


Pro 


Pro 


Leu 


Ala 


Glu 


Asp 


Ser 


He 


Gin 


Val 


Val 


Lys 


Ser 


Met 


305 










310 










315 










320 


Arg 


Ala 


Ala 


Ser 


Pro 


Pro 


Ser 


Tyr 


Thr 


Leu 


Asp 


Leu 


Gly 


Glu 


Ser 


Gin 










325 










330 










335 




Leu 


Ala 


Pro 


Pro 


Pro 


Ser 


Lys 


Leu 


Arg 


Gly 


Arg 


Arg 


Gly 


Ser 


Arg 


Arg 








340 










345 










350 






Gin 


Leu 


Leu 


Thr 


Lys 


Gin 


Asp 


Ser 


Leu 


Asp 


He 


Tyr 


Asn 


Glu 


Arg 


Asp 






355 










360 










365 








Leu 


Phe 


Lys 


Thr 


Glu 


Glu 


Pro 


Ala 


Thr 


Glu 


Glu 


Glu 


Glu 


Glu 


Ser 


Ala 




370 










375 










380 










Gly 


Asp 


Gly 


Glu 


Arg 


Thr 


Leu 


Asp 


Gly 


Glu 


Leu 


Asp 


Leu 


Leu 


Glu 


Gin 


385 










390 










395 










400 


Asp 


Pro 


Leu 


Val 


Pro 


Pro 


Pro 


Pro 


Ser 


Gin 


Ala 


Pro 


Leu 


Ser 


Ala 


Glu 










405 










410 










415 




Arg 


Val 


Ala 


Phe 


Pro 


Lys 


Gly 


Leu 


Pro 


Trp 


Ala 


Pro 


Lys 


Val 


Arg 


Gin 








420 










425 










430 
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Lys 


Asp 


He 


Glu 


His Phe Leu Glu 


Met 






435 




440 




Phe 


Thr 


Leu 


Gly 


Gin Asp Thr Asp 


Thr 




450 






"455 




He 


His 


Glu 


Ser 


Val Lys Thr Leu 


Lys 


465 








470 




Ala 


Asp 


Val 


Gin 


He Lys Asn Glu 


Glu 










485 




Ser 


Leu 


Gly 


Glu 


Glu Val Val Pro 


Glu 








500 




505 


Gin 


Gly 


Met 


Leu 


Tyr Ser Leu Pro 


Gin 






515 




520 




He 


Leu 


Leu 


Ala 


Ala Ala Pro Thr 


Ser 




530 






535 




Asn 


tie 


Leu 


Ala 


Asp Val Leu Pro 


Glu 


545 








550 




Gin 


Ser 


Met 


Lys 


Leu Gly lie Asp 


Val 










565 




Val 


Lys 


Ser 


He 


Ser Thr Leu Leu 


Leu 








580 




585 


Leu 


Asn 


His 


He 


Tyr Gin Phe Glu 


Tyr 






595 




600 




Ala 


Asn 


Cys 


He 


Pro Leu He Leu 


Lys 




610 






615 




Ser 


Tyr 


He 


Thr 


Ala Lys Asn Ser 


He 


625 








630 




Cys 


Thr 


He 


Gin 


Asp Leu Pro Glu 


Leu 










645 




Gly 


Asp 


Asn 


Ser 


Gin Phe Cys Trp 


Arg 








660 




665 


Leu 


Leu 


Arg 


Leu 


Leu Asn Lys Leu 


Thr 






675 




680 




Met 


Met 


Leu 


Val 


Val Phe Lys Ser 


Ala 




690 






695 




Lys 


Val 


Lys 


Gin 


Ala Met Leu Gin 


Leu 


705 








710 




Leu 


Gin 


Thr 


Lys 


Tyr Leu Gly Arg 


Gin 










725 




Thr 


Met 


Ser 


Ala 


He Tyr Gin Lys 


Val 








740 




745 


Trp 


Ala 


Tyr 


Gly 


Asn Asp He Asp 


Ala 






755 




760 




Glu 


Glu 


Cys 


Thr 


Leu Arg Ala Asn 


lie 




770 






775 




Tyr 


Asp 


Arg 


Pro 


Gin Asp Ser Glu 


Phe 


785 








790 




Gin 


Ser 


Val 


Leu 


Gly Gin Arg Leu 


Asp 










805 




Ser 


Tyr 


Glu 


Leu 


Trp Leu Glu Arg 


Glu 








820 




825 


Trp 


Glu 


Glu 


Leu 


Leu Gin Asn His 








835 




840 





<210> 267 
<211> 308 
<212> PRT 
<213> Homo sapiens 



<400> 267 
Pro Ala Trp Asn Ala Arg Pro Arg Gin 
1 5 
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Ser 


Arg 


Asn 


Lys 


Phe 


He 


Gly 








445 








Leu 


Val 


Gly 


Leu 


Pro 


Arg 


Pro 






460 










Gin 


His 


Lys 


Tyr 


He 


Ser 


He 




475 










480 


Glu 


Leu 


Glu 


Lys 


Cys 


Pro 


Met 


490 










495 




Thr 


Pro 


Cys 


Glu 


He 


Leu 


Tyr 










510 






Tyr 


Met 


He 


Ala 


Leu 


Leu 


Lys 








525 








Lys 


Ala 


Lys 


Thr 


Asp 


Ser 


He 






540 










Glu 


Met 


Pro 


He 


Thr 


Val 


Leu 




555 










560 


Asn 


Arg 


His 


Lys 


Glu 


He 


He 


570 










575 




Leu 


Leu 


Leu 


Lys 


His 


Phe 


Lys 










590 






Val 


Ser 


Gin 


His 


Leu 


Val 


Phe 








605 








Phe 


Phe 


Asn 


Gin 


Asn 


He 


Leu 






620 










Ser 


Val 


Leu 


Asp 


Tvr 


Pro 


Cvs 




635 










640 


Thr 


Thr 


Glu 


Ser 


Leu 


Glu 


Ala 


650 










655 




Asn 


Leu 


Phe 


Ser 


Cys 


He 


Asn 










670 






Lys 


Trp 


Lvs 


His 


Ser 


Arcr 


Thr 








685 








Pro 


He 


Leu 


LVS 


Arcr 


Ala 


Leu 






700 










Tyr 


Val 


Leu 


Lvs 


Leu 


Leu 


Lys 




715 










720 


Trp 

tr 


Arcr 


Lvs 


Ser 


Asn 


Met 


Lvs 


730 










735 




Arg 


His 


Arg 


Met 


Asn 


Asp 


Asp 










750 






Arg 


Pro 


Trp 

tr 


Asp 


Phe 


Gin 


Ala 








765 








Glu 


Ala 


Phe 


Asn 


Ser 


Arg 


Arg 






780 










Ser 


Pro 


Val 


Asp 


Asn 


Cys 


Leu 




795 










800 


Leu 


Pro 


Glu 


Asp 


Phe 


His 


Tyr 


810 










815 




Val 


Phe 


Ser 


Gin 


Pro 


He 


Cys 










830 






Val 
i n 


Asp 


Leu 


Lys 


Leu 


Thr 

1 c. 


His 
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Lys 


Lys 


Gin Arg 


Ala 


Leu 


Leu 


Glu 


Arg 








20 










25 


Pro 


Lys 


Asp 


Leu 


Thr 


Gin 


Pro 


Thr 


Tyr 






35 










40 




Cys 


Cys 


Cys 


Leu 


Phe 


He 


Leu 


Phe 


Leu 




50 










55 






Phe 


He 


Thr 


Thr 


Glu 


Val 


Val 


Asn 


Glu 


65 










70 








Lys 


Asp 


Ser Gly 


Gly 


Lys 


He 


Asp 


Val 










85 










Asn 


Leu 


His 


Cys 


Glu 


Leu 


Val 


Gly 


Leu 








100 










105 


Arg 


His 


Glu 


Val 


Gly 


His 


He 


Asp 


Asn 






115 










120 




Asn 


Gly 


Ala Gly 


Cys 


Arg 


Phe 


Glu 


Gly 




130 










135 






Pro 


Gly 


Asn 


Phe 


His 


Val 


Ser 


Thr 


His 


145 










150 








Asn 


Pro 


Asp 


Met 


Thr 


His 


Val 


He 


His 










165 










Leu 


Gin 


Val 


Gin 


Asn 


He 


His 


Gly 


Ala 








180 










185 


Asp 


Arg 


Leu 


Thr 


Ser 


Asn 


Pro 


Leu 


Ala 






195 










200 




He 


Val 


Pro 


Thr 


Val 


Tyr 


Glu 


Asp 


Lys 




210 










215 






Tyr 


Gin 


Tyr 


Thr 


Val 


Ala 


Asn 


Lys 


Glu 


225 










230 








Gly 


Arg 


He 


He 


Pro 


Ala 


He 


Trp 


Phe 










245 










Thr 


Val 


Lys 


Tyr 


Thr 


Glu 


Arg 


Arg 


Gin 








260 










265 


Thr 


He 


Cys 


Ala 


lie 


He 


Gly 


Gly 


Thr 






275 










280 




Asp 


Ser 


Cys 


lie 


Phe 


Thr 


Ala 


Ser 


Glu 




290 










295 






Gly 


Lys 


Met 


His 












305 




















<210> 268 














<211> 162 














<212> PRT 














<213> Homo 


sapiens 










<400> 268 












Met 


Leu 


He 


Tyr 


Ser 


Ser 


Lys 


Thr 


Leu 


l 








5 










Thr 


Pro 


Ser 


Asn 


Leu 


Trp 


Gly 


Gly 


Gin 








20 










25 


Arg 


Phe 


Cys 


Ser 


Phe 


Asp 


Gly 


Ala 


Asn 






35 










40 




Glu 


Val 


Glu 


Ser 


Asn 


Ser 


Pro 


Ala 


Ala 




50 










55 






Ser 


Asp 


Tyr 


He 


He 


Gly 


Ala 


Asp 


Thr 


65 










70 








Leu 


Phe 


Ser 


Leu 


He 


Glu 


Thr 


His 


Glu 










85 










Val 


Tyr 


Asn 


Thr 


Asp 


Thr 


val 


Tyr 


Thr 








100 










105 


Cys 


Val 


Lys 


Ser 


Ser 


Tyr 


Ser 


Gly 


Ala 






115 










120 
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Phe 


Asp 


He 


Tyr 


Arg 


Lys 


Val 










30 






Thr 


Gly 


Ala 


He 


He 


Ser 


He 








45 








Phe 


Leu 


Ser 


Glu 


Leu 


Thr 


Gly 






60 










Leu 


Tyr 


Val 


Asp 


Asp 


Pro 


Asp 




75 










80 


Ser 


Leu 


Asn 


He 


Ser 


Leu 


Pro 


90 










95 




Asp 


He 


Gin 


Asp 


Glu 


Met 


Gly 










110 






Ser 


Met 


Lys 


He 


Pro 


Leu 


Asn 








125 








Gin 


Phe 


Ser 


He 


Asn 


Lys 


Val 






140 










Ser 


Ala 


Thr 


Ala 


Gin 


Pro 


Gin 




155 










160 


Lys 


Leu 


Ser 


Phe 


Gly 


Asp 


Thr 


170 










175 




Phe 


Asn 


Ala 


Leu 


Gly 


Gly 


Ala 










190 






Ser 


His 


Asp 


Tyr 


He 


Leu 


Lys 








205 








Ser 


Gly 


Lys 


Gin 


Arg 


Tyr 


Ser 






220 










Tyr 


Val 


Ala 


Tyr 


Ser 


His 


Thr 




235 










240 


Arg 


Tyr 


Asp 


Leu 


Ser 


Pro 


He 


250 










255 




Pro 


Leu 


Tyr 


Arg 


Phe 


He 


Thr 










270 






Phe 


Thr 


Val 


Ala 


Gly 


He 


Leu 








285 








Ala 


Trp 


Lys 


Lys 


He 


Gin 


Leu 






300 










Glu 


Leu 


Arg 


Glu 


Thr 


Ser 


val 


10 










15 




Gly 


Leu 


Leu 


Gly 


Val 


Ser 


He 










30 






Glu 


Asn 


Val 


Trp 


His 


Val 


Leu 








45 








Leu 


Ala 


Gly 


Leu 


Arg 


Pro 


His 






60 










Val 


Met 


Asn 


Glu 


Ser 


Glu 


Asp 




75 










80 


Ala 


Lys 


Pro 


Leu 


Lys 


Leu 


Tyr 


90 










95 




Gly 


Asn 


Ser 


Thr 


Trp 


Lys 


Thr 










110 






Leu 


Val 


Asn 


Leu 


Asn 


Arg 


Leu 



125 
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Phe Ser Ser Ala Tyr Thr Arg lie Pro Cys Phe Gly Ala Leu Arg He " 

130 135 140 

Asn Ser Asp Lys His Phe Val Asn Gly Cys Trp Leu Leu Ser Thr Tyr 
145 150 155 160 

Thr Leu 



<210> 269 
<211> 280 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (280) 

<223> Xaa = X or * as defined in Table 6 



<400> 269 



Asn 


Leu 


Leu 


Leu Gly Gly 


Gly Gly 


1 






5 




Ara 


Glv 


Pro 


Phe Pro Glv 


Leu Ser 








20 




Pro 


Lys 


Arcr 


Pro Gly Cys 


Ser His 






35 




40 


Met 


Gly 


Leu 


Pro Lys He 


His His 




50 






55 


Gly 


Val 


Phe 


Leu Glu Val 


Asp Phe 


65 






70 




Glu 


Phe 


Phe 


Ser Val Asp 


Asp Asn 








85 




Met 


Lys 


Glu 


Asp Ser Asp 


Glu Asn 








100 




Glu 


Glu 


Met 


Glu Glu Gin 


Ser Gin 






115 




120 


Thr 


Val 


Pro 


Gly Pro Ala 


Ser Glu 




13 0 






135 


Glu 


Asp 


Phe 


Ala Lys Asp 


Glu Asn 


145 






150 




Asp 


His 


Ser 


Val Arg Asp 


Gin Asp 








165 




Ser 


Thr 


Lys 


Asn Glu He* 


Lys He 








180 




Met 


Glu 


Thr 


Glu Glu Leu 


Ser Ser 






195 




200 


Glu 


Gin 


Pro 


Glu Val He 


Pro Leu 




210 






215 


Gly 


Glu 


Lys 


Ala Pro Gly 


Glu Asp 


225 






230 




Glu 


Gly 


Ser 


Ser Asp Leu 


Glu Asn 








245 




Asn 


Ser 


Gin 


Val Asp Lys 


Val Asn 








260 




Cys 


Gin 


Ala 


Xaa Arg Ser 


Arg Gly 






275 




280 



Lys 


Lys 


Lys 


Lys 


Pro Pro Arg 


Thr 




10 






15 




Gin 


Pro 


Glv 


Leu 


Leu Trp Leu 


Phe 


25 








30 




Leu 


Pro 


Ser 


Thr 


Pro lie Lys 


Glu 










45 




Arg 


Val 


Gly 


Trp 


Glu Ser Phe 


Ser 








60 






Lys 


He 


Tyr 


Lys 


Lys Lys Met 


Asn 






75 






80 


Asn 


Glu 


Glu 


Glu 


Glu Asp Val 


Glu 




90 






95 




Gly 


Pro 


Glu 


Glu 


Lys Gin Ser 


Val 


105 








110 




Asp 


Ala 


Asp 


Gly 


Val Asn Thr 


Val 










125 




Glu 


Ala 


Val 


Glu 


Asp Cys Lys Asp 








140 






He 


Thr 


Lys 


Gly 


Gly Glu Val 


Thr 






155 






160 


His 


Pro 


Asp 


Gly 


Gin Glu Asn Asp 




170 






175 




Glu 


Thr 


Glu 


Ser 


Gin Ser Ser 


Tyr 


185 








190 




Asn 


Gin 


Glu 


Asp 


Ala Val He 


Val 










205 




Thr 


Glu 


Asp 


Gin 


Glu Glu Lys 


Glu 








220 






Thr 


Pro 


Arg 


Met 


Pro Gly Lys 


Ser 






235 






240 


Thr 


Pro 


Gly 


Pro 


Asp Val Glu 


Met 




250 






255 




Asp 


Pro 


Thr 


Glu 


Ser Gin Pro 


Ser 


265 








270 





<210> 270 
<211> 160 
<212> PRT 
<213> Homo sapiens 
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<220> 

<221> misc_feature 
<222> (1}...(160) 

<223> Xaa = X or * as defined in Table 6 



<400> 270 



Glu Arg 


Arg 


Glu 


Arg Ser 


Pro Asp 


Gin Ser Ser 


Gly Arg Ala 


Ser Arg 


1 






5 




10 




15 


Gly Pro 


Pro 


Glu 


Arg Gin 


Ser Leu 


Arg Met Ser 


Pro Ser Arg 


Ala Ala 






20 






25 


30 




Trp Thr 


Ser 


Ser 


Pro Cys 


Arg Ser 


Cys Ala Ser 


Gin Gly Val 


Cys Ala 




35 






40 




45 




Trp Pro 


Leu 


Asn 


Leu Arg 


Arg He 


Ala Ser Thr 


Ser Trp Cys 


Xaa Pro 


50 








55 




60 




Met Ser 


Ala 


Gly 


He Gly 


Pro Met 


Ala Trp Trp 


Pro Ser Thr 


Thr Gly 


65 






70 




75 




80 


Pro Cys 


Met 


Met 


Ser Thr 


Val Ser 


Thr Met Ala 


Lys Pro His 


Arg Glu 








85 




90 




95 


Cys Pro 


Gly 


Cys 


Phe Val 


Pro Phe 


Ala Val Cys 


Val Val Ser 


Arg Phe 






100 






105 


110 




Pro Tyr 


Tyr 


Asn 


Ser Leu 


Lys Asp 


Cys Leu Ser 


Trp His Tyr 


Arcr Arcr 




115 






120 




125 




Pro Gly 


Ala 


Thr 


Leu Leu 


Ser Pro 


Ser Ser Leu 


Val Thr Leu 


Tj#>!i Tjpii 

UCU J— I ^ L4. 


130 








135 




140 




Val Lys 


Gly 


Pro 


Gly Ala 


Ala Ala 


Ala Asp Ala 


Gly Glu He 


Pro Val 


145 






150 




155 




160 


<210> 271 












<211> 132 












<212> PRT 












<213> Homo 


sapiens 










<400> 271 












Ala Ser 


Ala 


Pro 


Val Gly 


Cys Leu 


Thr Arg Ala 


Val Cys Gly 


Arg Pro 


1 






5 




10 




15 


Pro Trp 


Arg 


Thr 


Asn Thr 


Val Val 


Glu Pro Arg 


Glu Gly Thr 


Arg He 






20 






25 


30 




Leu Glu 


Phe 


Gly 


His Leu 


Lys Leu 


Ala His Val 


Pro Pro Leu 


Glu Phe 




35 






40 




45 




Leu Val 


Asn 


Gin 


His Gin 


Pro Glu 


Asp His Val 


Leu He Lys 


Arg Trp 


50 








55 




60 




Lys , Glu 


Glu 


Lys 


Leu Glu 


Pro Ala 


Trp Glu Gly 


Pro Tyr Pro 


Val Leu 


65 






70 




75 




80 


Leu Thr 


Thr 


Lys 


Thr Ala 


Val Arg 


Thr Asp Lys 


Lys Lys Lys 


Lys Lys 








85 




90 




95 


Lys Lys 


Arg 


Trp 


Thr His 


His Thr 


Gin Val Lys 


Lys Val Pro 


Pro Pro 






100 






105 


110 




Pro Glu 


Ser 


Trp 


Ala He 


Val Pro 


Gly Glu Asn 


Pro Thr Lys 


Leu Lys 




115 






120 




125 




Leu Arg 


Lys 


Met 












130 

















<210> 272 
<211> 1262 
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<213> Homo sapiens 



<400> 272 



Met 


Arg 


Arg 


Gly 


Gly 


Trp 


Arg 


Lys 


Arg 


Ala 


Glu 


Asn 


Asp 


Gly 


Trp 


Glu 


1 








5 










10 










15 




Thr 


Trp 


Gly 


Gly 


Tyr 


Met 


Ala 


Ala 


Lys 


Val 


Gin 


Lys 


Leu 


Glu 


Glu 


Gin 








20 










25 










30 






Phe 


Arg 


Ser 


Asp 


Ala 


Ala 


Met 


Gin 


Lys 


Asp 


Gly 


Thr 


Ser 


Ser 


Thr 


He 






35 










40 










45 








Phe 


Ser 


Gly 


Val 


Ala 


He 


Tyr 


Val 


Asn 


Gly 


Tyr 


Thr 


Asp 


Pro 


Ser 


Ala 




50 










55 










60 










Glu 


Glu 


Leu 


Arg 


Lys 


Leu 


Met 


Met 


Leu 


His 


Gly 


Gly 


Gin 


Tyr 


His 


Val 


65 










70 










75 










80 


Tyr 


Tyr 


Ser 


Arg 


Ser 


Lys 


Thr 


Thr 


His 


He 


He 


Ala 


Thr 


Asn 


Leu 


Pro 










85 










90 










95 




Asn 


Ala 


Lys 


He 


Lys 


Glu 


Leu 


Lys 


Gly 


Glu 


Lys 


Val 


He 


Arg 


Pro 


Glu 








100 










105 










110 






Trp 


He 


Val 


Glu 


Ser 


He 


Lys 


Ala 


Gly 


Arg 


Leu 


Leu 


Ser 


Tyr 


He 


Pro 






115 










120 










125 








Tyr 


Gin 


Leu 


Tyr 


Thr 


Lys 


Gin 


Ser 


Ser 


Val 


Gin 


Lys 


Gly 


Leu 


Ser 


Phe 




130 










135 










140 










Asn 


Pro 


Val 


Cys 


Arg 


Pro 


Glu 


Asp 


Pro 


Leu 


Pro 


Gly 


Pro 


Ser 


Asn 


He 


145 










150 










155 










160 


Ala 


Lys 


Gin 


Leu 


Asn 


Asn 


Arg 


Val 


Asn 


His 


He 


Val 


Lys 


Lys 


He 


Glu 










165 










170 










175 




Thr 


Glu 


Asn 


Glu 


Val 


Lys 


Val 


Asn 


Gly 


Met 


Asn 


Ser 


Trp 


Asn 


Glu 


Glu 








180 










185 










190 






Asp 


Glu 


Asn 


Asn 


Asp 


Phe 


Ser 


Phe 


Val 


Asp 


Leu 


Glu 


Gin 


Thr 


Ser 


Pro 






195 










200 










205 








Gly 


Arg 


Lys 


Gin 


Asn 


Gly 


He 


Pro 


His 


Pro 


Arg 


Gly 


Ser 


Thr 


Ala 


He 




210 










215 










220 










Phe 


Asn 


Gly 


His 


Thr 


Pro 


Ser 


Ser 


Asn 


Gly 


Ala 


Leu 


Lys 


Thr 


Gin 


Asp 


225 










230 










235 










240 


Cys 


Leu 


Val 


Pro 


Met 


Val 


Asn 


Ser 


Val 


Ala 


Ser 


Arg 


Leu 


Ser 


Pro 


Ala 










245 










250 










255 




Phe 


Ser 


Gin 


Glu 


Glu 


Asp 


Lys 


Ala 


Glu 


Lys 


Ser 


Ser 


Thr 


Asp 


Phe 


Arg 








250 










265 










270 






Asp 


Cys 


Thr 


Leu 


Gin 


Gin 


Leu 


Gin 


Gin 


Ser 


Thr 


Arg 


Asn 


Thr 


Asp 


Ala 






275 










280 










285 








Leu 


Arg 


Asn 


Pro 


His 


Arg 


Thr 


Asn 


Ser 


Phe 


Ser 


Leu 


Ser 


Pro 


Leu 


His 




290 










295 










300 










Ser 


Asn 


Thr 


Lys 


He 


Asn 


Gly 


Ala 


His 


His 


Ser 


Thr 


Val 


Gin 


Gly 


Pro 


305 










310 










315 










320 


Ser 


Ser 


Thr 


Lys 


Ser 


Thr 


Ser 


Ser 


Val 


Ser 


Thr 


Phe 


Ser 


Lys 


Ala 


Ala 










325 










330 








335 




Pro 


Ser 


Val 


Pro 


Ser 


Lys 


Pro 


Ser 


Asp 


Cys 


Asn 


Phe 


He 


Ser 


Asn 


Phe 








340 










345 










350 






Tyr 


Ser 


His 


Ser 


Arg 


Leu 


His 


His 


He 


Ser 


Met 


Trp 


Lys 


Cys 


Glu 


Leu 






355 










360 










365 








Thr 


Glu 


Phe 


Val 


Asn 


Thr 


Leu 


Gin 


Arg 


Gin 


Ser 


Asn 


Gly 


He 


Phe 


Pro 




370 










375 










380 










Gly 


Arg 


Glu 


Lys 


Leu 


Lys 


Lys 


Met 


Lys 


Thr 


Gly 


Arg 


Ser 


Ala 


Leu 


Val 


385 










390 










395 










400 


Val 


Thr 


Asp 


Thr 


Gly 


Asp 


Met 


Ser 


Val 


Leu 


Asn 


Ser 


Pro 


Arg 


His 


Gin 










405 










410 










415 




Ser 


Cys 


He 


Met 


His 


Val 


Asp 


Met 


Asp 


Cys 


Phe 


Phe 


Val 


Ser 


Val 


Gly 








420 










425 










430 






He 


Arg 


Asn 


Arg 


Pro 


Asp 


Leu 


Lys 


Gly 


Lys 


Pro 


Val 


Ala 


Val 


Thr 


Ser 






435 










440 










445 








Asn 


Arg 


Gly 


Thr 


Gly 


Arg 


Ala 


Pro 


Leu 


Arg 


Pro 


Gly 


Ala 


Asn 


Pro 


Gin 




450 










455 










460 










Leu 


Glu 


Trp 


Gin 


Tyr 


Tyr 


Gin 


Asn 


Lys 


He 


Leu 


Lys 


Gly 


Lys 


Ala 


Ala 


465 










470 










475 










480 
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Asp 


He 


Pro 


Asp 


Ser 


Ser 


Leu 


Trp 










485 








Asn 


Gly 


He 


Asp 


Ser 


Val 


Leu 


Ser 








500 










Tyr 


Glu 


Ala 


Arg 


Gin 


Leu 


Gly 


He 






515 










520 


Ala 


Lys 


Gin 


Leu 


Cys 


Pro 


Asn 


Leu 




530 










535 




Ala 


Tyr 


Lys 


Glu 


Val 


Ala 


Gin 


Thr 


545 










550 






Thr 


His 


Asn 


He 


Glu 


Ala 


Val 


Ser 










565 








Thr 


Glu 


He 


Leu 


Ala 


Glu 


Thr 


Lys 








580 










Ala 


Val 


Arg 


Met 


Glu 


He 


Lys 


Asp 






595 










600 


Gly 


lie 


Gly 


Ser 


Asn 


He 


Leu 


Leu 




610 










615 




Lys 


Pro 


Asp 


Gly 


Gin 


Tyr 


His 


Leu 


625 










630 






He 


Arg 


Gly 


Gin 


Leu 


Val 


Thr 


Asn 










645 








Glu 


Ser 


Lys 


Leu 


Ala 


Ser 


Leu 


Gly 








660 










Tyr 


Met 


Thr 


Met 


Ala 


Lys 


Leu 


Gin 






675 










680 


Gin 


Met 


Leu 


Tyr 


Arg 


Phe 


Cys 


Arg 




690 










695 




Thr 


Glu 


Lys 


Glu 


Arg 


Lys 


Ser 


Val 


705 










710 






Arg 


Phe 


Thr 


Gin 


Pro 


Lys 


Glu 


Ala 










725 








Glu 


Glu 


He 


Gin 


Arg 


Arg 


Leu 


Glu 








740 










Leu 


Thr 


Leu 


Lys 


He 


Met 


Val 


Arg 






755 










760 


Ala 


Lys 


Phe 


Gly 


Gly 


Kis 


Gly 


He 




770 










775 




Thr 


Leu 


Asp 


Gin 


Ala 


Thr 


Asp 


Asn 


785 










790 






Leu 


Asn 


Met 


Phe 


His 


Thr 


Met 


Lys 










805 








val 


Gly 


He 


His 


Val 


Asn 


Gin 


Leu 








820 










Thr 


Cys 


Pro 


Ser 


Arg 


Pro 


Ser 


Val 






835 










840 


Ser 


Tyr 


Ser 


Val 


Arg 


Asp 


Val 


Phe 




850 










855 




Thr 


Glu 


Glu 


Glu 


His 


Lys 


Glu 


Val 


865 










870 






He 


Ser 


Ser 


Ala 


Ser 


Arg 


Thr 


cys 










885 








His 


Leu 


Pro 


Thr 


Ser 


Pro 


Asp 


Thr 








900 










Trp 


Asn 


Gly 


Leu 


His 


Thr 


Pro 


Val 






915 










920 


Ser 


He 


Glu 


Val 


Pro 


Ser 


Pro 


Ser 




930 










935 




Ala 


Leu 


Pro 


Pro 


Asp 


Leu 


Arg 


Glu 


945 










950 






Gin 


Gin 


Ala 


Glu 


Ser 


His 


Gly 


Asp 










965 






Cys 


Asn 


Thr 


Gly 


He 


Leu 


Pro 


Gin 



980 



Glu 


Asn 


Pro 


Asp 


Ser 


Ala 


Gin 


Ala 




490 










495 




Arg 


Ala 


Glu 


He 


Ala 


Ser 


Cys 


Ser 


505 










510 






Lys 


Asn 


Gly 


Met 


Phe 


Phe 


Gly 


His 










525 








Gin 


Ala 


Val 


Pro 


Tyr 


Asp 


Phe 


His 








540 










Leu 


Tyr 


Glu 


Thr 


Leu 


Ala 


Ser 


Tyr 






555 










560 


Cys 


Asp 


Glu 


Ala 


Leu 


Val 


Asp 


He 




570 










575 




Leu 


Thr 


Pro 


Asp 


Glu 


Phe 


Ala 


Asn 


585 










590 






Gin 


Thr 


Lys 


Cys 


Ala 


Ala 


Ser 


Val 










605 








Ala 


Arg 


Met 


Ala 


Thr 


Arg 


Lys 


Ala 








620 










Lys 


Pro 


Glu 


Glu 


Val 


Asp 


Asp 


Phe 






635 










640 


Leu 


Pro 


Gly 


Val 


Gly 


His 


Ser 


Met 




650 










655 




lie 


Lys 


Thr 


Cys 


Gly 


Asp 


Leu 


Gin 


665 










670 






Lys 


Glu 


Phe 


Gly 


Pro 


Lys 


Thr 


Gly 










685 








Gly 


Leu 


Asp 


Asp 


Arg 


Pro 


Val 


Arg 








700 










Ser 


Ala 


Glu 


He 


Asn 


Tyr 


Gly 


He 






715 










72 0 


Glu 


Ala 


Phe 


Leu 


Leu 


Ser 


Leu 


Ser 




730 










735 




Ala 


Thr 


Gly 


Met 


Lys 


Gly 


Lys 


Arg 


745 










750 






Lys 


Pro 


Gly 


Ala 


Pro 


Val 


Glu 


Thr 










765 








Cys 


Asp 


Asn 


He 


Ala 


Arg 


Thr 


Val 








780 










Ala 


Lys 


He 


He 


Gly 


Lys 


Ala 


Met 






795 










800 


Leu 


Asn 


He 


Ser 


Asp 


Met 


Arg 


Gly 




810 










815 




Val 


Pro 


Thr 


Asn 


Leu 


Asn 


Pro 


Ser 


825 










830 






Gin 


Ser 


Ser 


His 


Phe 


Pro 


Ser 


Gly 










845 








Gin 


Val 


Gin 


Lys 


Ala 


Lys 


Lys 


Ser 








860 










Phe 


Arg 


Ala 


Ala 


Val 


Asp 


Leu 


Glu 






875 










880 


Thr 


Phe 


Leu 


Pro 


Pro 


Phe 


Pro 


Ala 




890 










895 




Asn 


Lys 


Ala 


Glu 


Ser 


Ser 


Gly 


Lys 


905 










910 






Ser 


Val 


Gin 


Ser 


Arg 


Leu 


Asn 


Leu 










925 








Gin 


Leu 


Asp 


Gin 


Ser 


Val 


Leu 


Glu 








940 










Gin 


Val 


Glu 


Gin 


Val 


Cys 


Ala 


Val 






955 










960 


Lys 


Lys 


Lys 


Glu 


Pro 


Val 


Asn 


Gly 




970 










975 




Pro 


Val 


Gly 


Thr 


Met 


Ser 


Leu 


Leu 


985 










990 
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Gin lie Pro Glu Pro Gin Glu Ser Asn Ser Asp Ala Gly lie Asn Leu 

995 1000 1005 

lie Ala Leu Pro Ala Phe Ser Gin Val Asp Pro Glu Val Phe Ala Ala 

1010 1015 1020 

Leu Ser Ala Glu Leu Gin Arg Glu Leu Lys Ala Ala Tyr Asp Gin Arg 
1025 1030 1035 1040 

Gin Arg Gin Gly Glu Asn Ser Thr His Gin Gin Ser Ala Ser Ala Ser 

1045 1050 1055 

Val Pro Lys Asn Pro Leu lie His Leu Lys Ala Ala Val Lys Glu Lys 

1060 1065 1070 

Lys Arg Asn Lys Lys Lys Lys Thr lie Gly Ser Pro Lys Arg He Gin 

1075 1080 1085 

Ser Pro Leu Asn Asn Lys Leu Leu Asn Ser Pro Ala Lys Thr Leu Pro 

1090 1095 1100 

Gly Ala Cys Gly Ser Pro Gin Lys Leu He Asp Gly Phe Leu Lys His 
1105 mo 1115 " 1120 

Glu Gly Pro Pro Ala Glu Lys Pro Leu Glu Lys Asn Ser Ser Gly Phe 

1125 1130 H35 

Leu Leu Ser Gly Val Pro Gly Leu Ser Ser Leu Gin Ser Asp Pro Ser 

1140 H45 1150 

Leu Gly Cys Val Arg Pro Pro Pro Pro Asn Leu Ala Gly Ala Val Glu 

1155 H60 1165 

Phe Asn Asp Val Lys Thr Leu Leu Arg Glu Trp Val Thr Thr He Ser 

1170 H75 1180 

Asp Pro Met Glu Glu Asp He Leu Gin Val Val Lys Tyr Cys Thr Asp 
1185 1190 1195 * 1200 

Leu He Glu Asp Lys Asp Leu Glu Lys Leu Asp Leu Val He Lys Tyr 

1205 1210 1215 

Met Lys Arg Leu Met Gin Gin Ser Val Glu Ser Val Trp Asn Met Ala 

1220 1225 1230 

Phe Asp Phe He Leu Asp Asn Val Gin Val Val Leu Gin Gin Thr Tyr 

1235 1240 1245 

Gly Ser His He Lys Ser Tyr He Asn He Thr Gin Arg Ala 
1250 1255 1260 



<210> 273 
<211> 260 
<212> PRT 

<213> Homo sapiens 
<220> 

<22l> misc_feature 
<222> (1) . . . (260) 

<223> Xaa = X or * as defined in Table 6 



<400> 273 



Met 


Ala 


Glu 


Thr Glu 


Glu 


Arg 


Ser 


Leu 


Asp 


Asn Phe Phe Ala Lys Arg 


1 






5 










10 


15 


Asp 


Lys 


Lys 


Lys Lys 


Lys 


Glu 


Arg 


Ser 


Asn 


Arg Ala Ala Ser Ala Ala 








20 








25 




30 


Gly 


Ala 


Ala 


Gly Ser 


Ala 


Gly 


Gly 


Ser 


Ser 


Gly Ala Ala Gly Ala Ala 






35 








40 






45 


Gly 


Gly 


Gly 


Ala Gly 


Ala 


Gly 


Thr 


Arg 


Pro 


Gly Asp Gly Gly Thr Ala 




50 








55 








60 


Ser 


Ala 


Gly 


Ala Ala 


Gly 


Pro 


Gly 


Ala 


Ala 


Thr Lys Ala Val Thr Lys 


65 








70 










75 80 


Asp 


Glu 


Asp 


Glu Trp 


Lys 


Glu 


Leu 


Glu 


Gin 


Lys Glu Val Asp Tyr Ser 








85 










90 


95 


Gly 


Leu 


Arg 


Val Gin 


Ala 


Met 


Gin 


lie 


Ser 


Ser Glu Lys Glu Glu Asp 








100 








105 




110 


Asp 


Asn 


Glu 


Lys Arg 


Gin 


Asp 


Pro 


Gly 


As P 


Asn Trp Glu Glu Gly Gly 
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115 










120 








125 








Gly 


Gly 


Gly 


Gly 


Gly 


Met 


Glu 


Lys 


Ser 


Ser 


Gly Pro Trp 


Asn 


Lys 


Thr 




130 










135 










140 








Ala 


Pro 


Val 


Gin 


Ala 


Pro 


Pro 


Ala 


Pro 


Val 


He 


Val Thr 


Glu 


Thr 


Pro 


145 










150 










155 








160 


Glu 


Pro 


Ala 


Met 


Thr 


Ser 


Gly 


Val 


Tyr 


Arg 


Pro 


Pro Gly Ala 


Arg 


Leu 










165 










170 








175 




Thr 


Thr 


Thr 


Arg 
180 


Lys 


Thr 


Pro 


Gin 


Gly 
185 


Pro 


Pro 


Glu He 


Tyr 
190 


Gin 


Xaa 


Tyr 


His 


Ser 
195 


Ser 


His 


Pro 


Leu 


Ala 
200 


Val 


Asn 


Leu 


Pro Lys 
205 


His 


Val 


Glu 


Ser 


Arg 


Lys 




Lys 


Glu 


Met 


Glu 


Lys 


Ser 


Phe 


Glu Val 


Val 


Arg His 




210 










215 










220 








Lys 


Asn 


Arg 


Gly 


Arg 


Asp 


Glu 


Val 


Ser 


Lys 


Asn 


Gin Ala 


Leu 


Lys 


Leu 


225 










230 










235 








240 


Gin 


Leu 


Asp 


Asn 


Gin 


Tyr 


Ala 


Val 


Leu 


Glu 


Asn Gin Lys 


Ser 


Ser 


His 










245 










250 








255 




Ser 


Gin 


Tyr 


Asn 

























260 



<210> 274 
<211> 122 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (122) 

<223> Xaa = X or * as defined in Table 6 



<400> 274 



His 


Leu 


Arg 


He Leu Arg 


Asp 


Ser 


Arg 


Thr 


His Ser Tyr Phe 


Leu 


Thr 


1 






5 








10 




15 




Ser 


Leu 


Arg 


Gly Glu Asn 


Asn 


Pro 


Trp 


Thr 


Asp Gin Ser Pro 


Cys Ala 








20 






25 




30 






Ala 


Ala 


Ser 


Arg Ala Gin 


His 


Leu 


His 


Pro 


Ala Ala Val Ala 


Ala 


Ala 






35 






40 






45 






Thr 


Met 


Pro 


Lys Thr Lys 


Ala 


Glu 


Gly 


Asp 


Ala Lys Gly Asp 


Lys 


Ala 




50 






55 








60 






Lys 


Val 


Lys 


Asp Glu Pro 


Gin Val 


Thr 


Arg 


Ala Ala He Gin 


Thr 


Asn 


65 






70 










75 




80 


Thr 


Phe 


He 


Phe Lys Cys 


Xaa 


He 


Glu 


Pro 


Gin Lys Gin He 


Tyr 


He 








85 








90 




95 




Leu 


Tyr 


He 


Gin Asn Ser 


Cys 


Gin 


He 


Ser 


Leu Leu He Leu 


Pro 


Lys 








100 






105 




110 






Ser 


Thr 


Leu 


Met Lys Trp 


Met 


Gin 


Thr 


Leu 









115 120 



<210> 275 
<211> 630 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (630) 

<223> Xaa = X or * as defined in Table 6 
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<400> 275 



Ser 


Ser 


Val 


Glu 


1 








His 


Asp 


Pro 


Asp 








2 0 


Pro 


Glu 


Tyr 


Ser 






35 




Pro 


Pro 


Phe 


Lys 




50 






Thr 


Lys 


He 


Ala 


65 








He 


Asp 


Arg 


Val 


Glu 


Glu 


Gly 


Asp 








100 


Leu 


Arg 


Arg 


Val 






115 




Asn 


Ser 


Asp 


He 




130 






Val 


Ser 


Gly 


Met 


145 








Cys 


Glu 


Arg 


Cys 


Tyr 


Ser 


Val 


Gin 








180 


Gly 


Thr 


Asp 


lie 






195 




Ala 


Ala 


Gly 


Gly 




210 






Thr 


val 


Gin 


Phe 


225 








He 


His 


Tyr 


Pro 


Leu 


Glu 


Ser 


Ala 








260 


Leu 


Cys 


Glu 


Thr 






275 




Ala 


Met 


Pro 


Glu 




290 






Arg 


Pro 


Tyr 


Val 


305 








Ala 


Ser 


Trp 


Gly 


He 


Asn 


Pro 


Thr 








340 


Phe 


Lys 


lie 


Val 






355 




His 


Arg 


Cys 


Ser 




370 






Pro 


Ser 


Pro 


Asp 


385 








Gin 


Tyr 


Leu 


Ala 


His 


Gly 


His 


Ser 








420 


Lys 


Glu 


Thr 


Val 






435 




Val 


Glu 


Asp 


Thr 




450 






Ala 


Pro 


Ala 


Phe 


465 








Lys 


Glu 


Ser 


Thr 


Arg 


Ser 


Tyr 


Thr 



Gin 


Ala 


Ser 


Val 


5 








Leu 


Tyr 


He 


Glu 


Glu 


Val 


Ala 


Tyr 








40 


Glu 


Pro 


He 


Leu 






55 




Gin 


Asp 


He 


Glu 




70 






Val 


Tyr 


Asp 


Leu 


85 








He 


Leu 


Lys 


Phe 


He 


Gin 


He 


Arg 








120 


Asn 


Ser 


Asn 


His 






135 




Arg 


Pro 


Gly 


Val 




150 






Asn 


Arg 


Leu 


Phe 


165 








Glu 


Ala 


Leu 


Asn 


Arg 


Tyr 


Tyr 


He 








200 


Ala 


Gin 


Arg 


Gly 






215 




Ser 


Thr 


Xaa 


Arg 




230 






Tyr 


Thr 


Tyr 


Ser 


245 








His 


Asn 


Pro 


Gin 


Leu 


Ser 


Gly 


Asn 








280 


Ser 


Asn 


Tyr 


Tyr 






295 




Leu 


Met 


Tyr 


Ala 




310 






Tyr 


Glu 


Arg 


Glu 


325 








Gly 


Phe 


Arg 


Ser 


Pro 


Met 


Leu 


Asn 








360 


Leu 


Ser 


Gly 


Glu 






375 




Leu 


His 


Pro 


Thr 




390 






Ala 


Val 


Lys 


Arg 


405 








Arg 


Lys 


Lys 


Asn 


Trp 


His 


Thr 


Asn 








440 


Gly 


Tyr 


Arg 


Thr 






455 




Cys 


Met 


Ser 


Ser 




470 






Ala 


Arg 


Val 


Val 


485 








Met 


Glu 


Ser 


Thr 



Glu 


Val Pro 


Asp 




10 




He 


Val Lys 


Asn 


25 






Pro 


Asp Tyr 


Phe 


Glu 


Arg Pro 


Tyr 






60 


Arg 


Leu He 


His 




75 




Asp 


Asn Pro 


Asn 




90 




Asn 


Ser Lys 


Phe 


105 






Lys 


Asn Glu 


Tyr 


Tyr 


His Gin 


Trp 






140 


Ala 


Tyr Arg 


Phe 




155 




Asn 


Tyr Gly 


Met 




170 




Ala 


Arg Pro 


Trp 


185 






Asn 


His Phe 


Ser 


Lys 


Ser Tyr 


Tyr 






220 


Met 


Asp Val 


Cys 




235 




Thr 


Leu Gin 


Met 




250 




Gin 


He Tyr 


Phe 


265 






Ser 


Cys Pro 


Leu 


Glu 


His He 


Cys 






300 


Arg 


Val His 


Pro 




315 




Arg 


Trp Glu 


Tyr 




330 




Leu 


Arg Arg 


Asn 


345 






Pro 


Asp Gly 


Val 


Asp 


Leu Asn 


Arg 






380 


He 


Tyr His 


Ala 




395 




Leu 


Pro Leu 


Val 




410 




Val 


Phe Met 


Tyr 


425 






Asp 


Asn Ala 


Thr 


Leu 


Pro Lys 


He 






460 


Cys 


Ser Phe 


Val 




475 




Val 


Xaa Arg 


Glu 




490 




Leu 


Cys Gly 


Cys 




170 





Gly 


Pro 


Thr 


Leu 






15 




Thr 


Lys 


Ser 


Val 




30 






Gly 


His 


He 


Pro 


45 








Gly 


Val 


Gin 


Arg 


Gin 


Ser 


Asp 


He 








80 


Tyr 


Thr 


He 


Pro 






95 




Glu 


Ser 


Gly 


Asn 




110 






Asp 


Leu 


He 


Leu 


125 








Phe 


Tyr 


Phe 


Glu 


Asn 


He 


He 


Asn 








160 


Gin 


Pro 


Leu 


Met 






175 




Trp 


He 


Arg 


Met 




190 






Arg 


Ser 


Ser 


Val 


205 








Thr 


He 


Thr 


Phe 


Tyr 


Phe 


Ala 


Tyr 








240 


His 


Leu 


Gin 


Lys 






255 




Arg 


Lys 


Asp 


Val 




270 






val 


Thr 


He 


Thr 


285 








His 


Phe 


Arg 


Asn 


Gly 


Glu 


Thr 


Asn 








320 


Leu 


His 


Glu 


Ala 






335 




Leu 


Tyr 


Tyr 


He 




350 






He 


Asn 


Gly 


Asn 


365 








Gin 


Trp 


Gin 


Ser 


Lys 


Gly 


Leu 


Leu 








400 


Tyr 


Cys 


Asp 


Tyr 






415 




Gly 


Cys 


Ser 


He 




430 






Ser 


Cys 


A3p 


Val 


445 








Leu 


Ser 


His 


He 


Val 


Glu 


Lys 


Ser 








480 


He 


Gly 


Val 


Gin 






495 




Asp 


Gin 


Gly 


Lys 
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500 






Tyr 


Lys 


Gly 


Leu Gin 


He 


Gly Thr 






515 






520 


Lys 


Phe 


Cys 


Val Gly 


Leu 


Leu Arg 




530 








535 


Glu 


Tyr 


Asn 


Pro Ala 


Leu 


Pro Ser 


545 








550 




Leu 


Asn 


Xaa 


lie Gin 


Ala 


Cys Lys 








565 






Ser 


Leu 


Asp 


Glu Asp 


Glu 


Pro Arg 








580 






Ala 


Glu 


Ser 


Asn Asp 


Glu 


Leu Asp 






595 






600 


Asp 


Tyr 


Glu 


Pro Ser 


Ala 


Gin Glu 




610 








615 


Ser 


Arg 


Thr 


Tyr Leu 


Pro 




625 








630 





505 










510 




Arg 


Glu 


Leu 


Glu 


Glu 


Met 


Gly Ala 










525 






Leu 


Lys Arg 


Leu 


Thr 


Ser 


Pro Leu 








540 








Pro 


Ala 


Leu 


Thr 


Phe 


Glu 


Asn Asp 






555 








560 


Val 


Thr 


Ser 


Pro 


Tyr 


Pro 


Leu Met 




570 










575 


Phe 


Leu 


Glu 


Glu 


Val 


Asp 


Tyr Ser 


585 










590 




lie 


Glu 


Leu 


Ala 


Glu 


Asn 


Val Gly 










605 






Glu 


Val 


Leu 


Ser 


Asp 


Ser 


Glu Leu 



620 



<210> 276 
<211> 812 
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<220> 

<221> misc_feature 
<222> (1) . . . (812) 

<223> Xaa = X or * as defined in Table 6 



<400> 276 



lie 


Lys 


Ala 


Leu 


Ser 


Ser 


Ser 


Ala 


Glu 


Asp 


Ala 


Ser 


Leu 


Val 


Asn 


Ala 


1 








5 










10 










15 




Ser 


lie 


Ser 


Ser 
20 


Ser 


Val 


Lys 


Ala 


Thr 
25 


Ser 


Pro 


Val 


Lys 


Ser 
30 


Thr 


Thr 


Ser 


He 


Thr 
35 


Asp 


Ala 


Lys 


Ser 


Cys 
40 


Glu 


Gly 


Gin 


Asn 


Pro 
45 


Glu 


Leu 


Leu 


Pro 


Lys 
50 


Thr 


Pro 


lie 


Ser 


Pro 
55 


Leu 


Lys 


Thr 


Gly 


Val 
60 


Ser 


Lys 


Pro 


He 


Val 


Lys 


Ser 


Thr 


Leu 


Ser 


Gin 


Thr 


Val 


Pro 


Ser 


Lys 


Gly 


Glu 


Leu 


Ser 


65 










70 










75 










80 


Arg 


Glu 


He 


Cys 


Leu 
85 


Gin 


Ser 


Gin 


Ser 


Lys 
90 


Asp 


Lys 


Ser 


Thr 


Thr 

95 


Pro 


Gly 


Gly 


Thr 


Gly 
100 


He 


Lys 


Pro 


Phe 


Leu 
105 


Glu 


Arg 


Phe 


Gly 


Glu 
110 


Arg 


Cys 


Gin 


Glu 


His 
115 


Ser 


Lys 


Glu 


Ser 


Pro 
120 


Ala 


Arg 


Ser 


Thr 


Pro 
125 


His 


Arg 


Thr 


Pro 


He 
130 


He 


Thr 


Pro 


Asn 


Thr 
135 


Lys 


Ala 


He 


Gin 


Glu 
140 


Arg 


Leu 


Phe 


Lys 


Gin 


Asp 


Thr 


Ser 


Ser 


Ser 


Thr 


Thr 


His 


Leu 


Ala 


Gin 


Gin 


Leu 


Lys 


Gin 


145 










150 










155 








160 


Glu 


Arg 


Gin 


Lys 


Glu 


Leu 


Ala 


Cys 


Leu 


Arg 


Gly 


Arg 


Phe 


Asp 


Lys Gly 










165 










170 










175 




Asn 


He 


Trp 


Ser 
180 


Ala 


Glu 


Lys 


Gly 


Gly 
185 


Asn 


Ser 


Lys 


Ser 


Lys 
190 


Gin 


Leu 


Glu 


Thr 


Lys 
195 


Gin 


Glu 


Thr 


His 


Cys 
200 


Gin 


Ser 


Thr 


Pro 


Leu 
205 


Lys 


Lys 


His 


Gin 


Gly 


Val 


Ser 


Lys 


Thr 


Gin 


Ser 


Leu 


Pro 


Val 


Thr 


Glu 


Lys 


Val 


Thr 




210 










215 










220 








Glu 


Asn 


Gin 


lie 


Pro 


Ala 


Lys 


Asn 


Ser 


Ser 


Thr 


Glu 


Pro 


Lys 


Glu 


Val 


225 










230 










235 








240 


He 


Arg 


Glu 


He 


Glu 
245 


Met 


Ser 


Val 


Asp 


Asp 
250 


Asp 


Asp 


He 


Asn 


Ser 
255 


Ser 
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Lys 


Val 


He 


Asn 
260 


Asp 


Leu 


Phe 


Ser 


Asp 


Met 


Glu 
275 


Lys 


Ser 


Gin 


Ala 


Gly 
280 


Ala 


Ala 
290 


Lys 


Asn 


Arg 


Lys 


Met 
295 


His 


Leu 


His 


His 


Trp 


His 


Lys 


Gin 


Leu 


305 










310 






Asp 


Trp 


Asn 


Xaa 


Lys 
325 


Thr 


Pro 


Ala 


Glu 


Asn 


Ser 


Lys 
340 


Glu 


Leu 


Val 


Ser 


Leu 


Gly 


Ser 
355 


Glu 


Asp 


Arg 


Asp 


Leu 
360 


Glu 


Thr 
370 


Glu 


Arg 


Pro 


Ser 


He 
375 


Lys 


Val 


Thr 


Ser 


Lys 


Leu 


Asp 


Glu 


Lys 


385 










390 






Asn 


He 


Lys 


Gin 


Lys 
405 


Met 


Gin 


Glu 


Gin 


Thr 


Val 


He 
420 


Tyr 


Gin 


Ala 


Ser 


Glu 


Glu 


His 
435 


Gly 


Lys 


Gly 


Ser 


Leu 
440 


Leu 


Leu 
450 


He 


Ala 


Thr 


Gly 


Lys 

2 

455 


Arcr 


Lys 


Leu 


Lys 


Asn 


Glu 


Gly 


Pro 


Gin 


4S5 










470 






Pro 


Ser 


Glu 


Phe 


He 
485 


Ala 


He 


Pro 


He 


Arg 


Leu 


Pro 
500 


Xaa 


Lys 


Ala 


Asp 


Pro 


Asp 


Ala 
515 


Ala 


Asn 


Tyr 

2 


Tyr 


Tyr 
520 


Glu 


Asn 
530 


Met 


Val 


Ala 


Thr 


Pro 

535 


Leu 


Gly 


Asp 


Ala 


Leu 


Thr 


Phe 


Thr 


Thr 


545 










550 






Asn 


Asp 


Phe 


Glu 


He 
565 


Asn 


He 


Glu 


Asp 


Pro 


Ser 


Gly 
580 


Leu 


Asp 


Lys 


Lys 


Ser 


Asn 


He 
595 


His 


Ser 


Ser 


Val 


Met 
600 


Val 


Arg 
610 


Thr 


Ser 


Asn 


Phe 


Ala 
615 


Leu 


Ser 


Ser 


Val 


Gly 


Asn 


Thr 


Lys 


Phe 


625 










630 






Ser 


Ser 


Leu 


Glu 


Gly 
645 


His 


He 


Tyr 


Ser 


Ser 


Val 


Glu 
660 


Glu 


Arg 


Gly 


Phe 


Leu 


Gin 


Pro 
675 


Lys 


Arg 


Gin 


Thr 


He 
680 


Ala 


Trp 
690 


His 


Arg 


Arg 


Trp 


Cys 
695 


Val 


Trp 


Thr 


Tyr 


Pro 


Asp 


Asp 


Glu 


Lys 

2 


705 










710 






Asn 


Leu 


Ala 


Asn 


Cys 
725 


Thr 


Ser 


Arg 


Phe 


Cys 


Ala 


Arg 
740 


Arg 


Asn 


Thr 


Phe 


Arg 


Glu 


Asp 
755 


Asp 


Arg 


Glu 


Thr 


Leu 
760 



Asp 


Val 


Leu 


Glu 


Glu 


Gly 


Glu 


Leu 


265 










270 






Asp 


Gly 


Ser 


Ser 


He 


Ser 


Arg 


Thr 










285 








Xaa 


He 


Ser 


Pro 


Gin 


Cys 


Leu 


Tyr 








300 










Val 


Xaa 


Val 


Xaa 


Cys 


Pro 


His 


Leu 






315 










320 


Glu 


Val 


Met 


Lys 


Val 


Gin 


Asn 


Gin 




330 










335 




Arg 


Arg 


Ala 


Glu 


Ser 


Gly 


Asp 


Ser 


345 










350 






Leu 


Tyr 


Arg 


Ser 


Gin 


Arg 


Phe 


Lys 










365 








Gin 


Val 


He 


Val 


Arg 


Lys 

2 


Glu 


Asp 








380 










Asn 


Asn 


Ala 


Phe 


Pro 


Cys 


Gin 


Val 






395 










400 


Leu 


Asn 


Asn 


Glu 


He 


Asn 


Met 


Gin 




410 










415 




Gin 


Ala 


Leu 


Asn 


Cys 


Cys 


Val 


Asp 


425 










430 






Glu 


Glu 


Ala 


Glu 


Ala 


Glu 


Arc? 


Leu 










445 








Thr 


Leu 


Leu 


He 


Asp 


Glu 


Leu 


Asn 








460 










Arcr 


Lys 


Asn 


Xaa 


Gly 


Xaa 


Ser 


Ala 






475 










480 


Lys 


Asp 


Gin 


Phe 


Thr 


Leu 


Ser 


Glu 




490 










495 




Phe 


Val 


Cys 

2 


Ser 


Thr 


Val 


Gin 


Lvs 


505 










510 






Leu 


He 


He 


Leu 


Lys 


Ser 


Arcr 


Ser 










525 








Ala 


Ser 


Thr 


Ser 


Asn 


Ser 


Leu 


Asn 








540 










Thr 


Phe 


Thr 


Leu 


Gin 


Asp 


Val 


Ser 






555 










560 


Val 


Tyr 


Ser 


Leu 


Val 


Gin 


Lys 


Lvs 




570 










575 




Lys 


Lys 


Thr 


Ser 


Lys 


Ser 


Lys 

2 


Lys 


585 










590 






Ala 


Ser 


Pro 


Gly 


Gly 


Leu 


Ser 


Ala 










605 








Val 


Gly 


Ser 


Tyr 


Thr 


Leu 


Ser 


Leu 








620 










Val 


Leu 


Asp 


Lys 


Val 


Pro 


Phe 


Leu 






635 










640 


Leu 


Lys 


He 


Lys 


Cys 


Gin 


Val 


Asn 




650 










655 




Leu 


Gly 


Cys 


Pro 


Gly 


Gly 


Gly 


Arg 


665 










670 






Phe 


Glu 


Asp 


Val 


Ser 


Gly 


Phe 


Gly 










685 








Leu 


Ser 


Gly 


Asn 


Cys 


He 


Ser 


Tyr 








700 










Arg 


Lys 


Asn 


Pro 


He 


Gly 


Arg 


He 






715 










720 


Gin 


He 


Glu 


Pro 


Ala 


Asn 


Arg 


Glu 




730 










735 




Glu 


Leu 


He 


Thr 


Val 


Arg 


Pro 


Gin 


745 










750 






Val 


Thr 


Asn 


Ala 


Gly 


Thr 


His 


Ser 



765 
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Val Phe Thr Lys Asn Trp Leu Ser 

770 775 

Leu Trp Met Gin Lys Leu Asn Gin 

785 790 

Gin Pro Asp Ala Cys Tyr Lys Pro 
805 



Ala Asp Thr Lys Glu Glu Arg Asp 
780 

Val Leu Cys Asp lie Arg Leu Trp 
795 800 
lie Gly Lys Pro 
810 
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<400> 277 



Met 


Gin 


lie 


Asn 


Glu 


Thr 


lie 


Trp 


Asp 


Thr 


Val 


Gly 


Ala 


Ala 


Ser 


Arq 


1 








5 










10 










15 




His 


Gly 


Glu 


Gly 


Glu 


Arg 


Gin 


Ala 


Lys 


Ser 


Ser 


Thr 


Arg 


Gly 


Cvs 


Thr 








20 










25 










30 






His 


Leu 


Ala 


Glu 


Gly 


Gin 


Gly 


lie 


Tyr 


Leu 


'Gin 


Glu 


Glu 


Gin 


Ser 


Pro 






35 










40 










45 








Pro 


Glu 


Met 


Cys 


Thr 


Arg 


Val 


Met 


Glu 


Lys 


Arq 


Glu 


Gly 


Leu 


Thr 


He 




50 










55 










60 










Glu 


Arg 


Glu 


Arg 


Asp 


Pro 


Leu 


Leu 


Pro 


Val 


Trp 


Lys 


Ala 


Leu 


Gly 


He 


65 










70 










75 










80 


Gin 


Ala 


His 


Lys 


Cys 


Val 


Ala 


His 


Thr 


Thr 


Asn 


Pro 


Ser 


Lys 


Ala 


Thr 










85 










90 










95 




Ala 


Val 


His 


Leu 


Pro 


His 


Leu 


Thr 


Met 


Gin 


Pro 


Gin 


Gly 


Cys 


Leu 


Met 








100 










105 










110 






Ser 


Phe 


Phe 


Pro 


Thr 


Ala 


Ala 


Glu 


Phe 


Ser 


Thr 


Tyr 


Gly 


Gin 


Glu 


Leu 






115 










12 0 










125 








Tyr 


Leu 


Glu 


Asn 


Asn 


Gin 


lie 


Glu 


Glu 


He 


Thr 


Glu 


He 


Cys 


Phe 


Asn 




130 










135 










140 










His 


Thr 


Arg 


Lys 


He 


Asn 


Val 


lie 


Val 


Leu 


Arg 


Tyr 


Asn 


Lys 


He 


Glu 


145 










150 










155 










160 


Glu 


Asn 


Arg 


He 


Ala 


Pro 


Leu 


Ala 


Trp 


He 


Asn 


Gin 


Glu 


Asn 


Leu 


Glu 










165 










170 










175 




Ser 


lie 


Asp 


Leu 


Ser 


Tyr 


Asn 


Lys 


Leu 


Tyr 


His 


Val 


Pro 


Ser 


Tyr 


Leu 








180 










185 










190 






Pro 


Lys 


Ser 


Leu 


Leu 


His 


Leu 


Val 


Leu 


Leu 


Gly 


Asn 


Gin 


He 


Glu 


Arg 






195 










200 










205 








lie 


Pro 


Gly 


Tyr 


Val 


Phe 


Gly 


His 


Met 


Glu 


Pro 


Gly 


Leu 


Glu 


Tyr 


Leu 




210 










215 










220 










Tyr 


Leu 


Ser 


Phe 


Asn 


Lys 


Leu 


Ala 


Asp 


Asp 


Gly 


Met 


Asp 


Arg 


Val 


Ser 


225 










230 










235 










240 


Phe 


Tyr 


Gly 


Ala 


Tyr 


His 


Ser 


Leu 


Arg 


Glu 


Leu 


Phe 


Leu 


Asp 


His 


Asn 










245 










250 










255 




Asp 


Leu 


Lys 


Ser 


He 


Pro 


Pro 


Gly 


He 


Gin 


Glu 


Met 


Lys 


Ala 


Leu 


His 








260 










265 










270 






Phe 


Leu 


Arg 


Leu 


Asn 


Asn 


Asn 


Lys 


He 


Arg 


Gly 


Asn 


Lys 


Gin 


Glu 


He 






275 










280 










285 








Lys 


Gin 


Thr 


Ser 


Lys 


Gin 


Ala 


Ser 


Ala 


Val 


Gin 


Ser 


Glu 


Lys 


Trp 


Val 




290 










295 










300 










Thr 


Met 


Arg 


Arg 


Ala 


His 


Trp 


Gly 


Leu 


Arg 


Ala 


Ala 


Arg 


Arg 


Leu 


Arg 


305 










310 










315 










320 


Pro 


Pro 


Ser 


Thr 


Ala 


Trp 


lie 


Asn 


Ser 


Arg 


Ser 


Arg 


Pro 


Val 


Pro 


Val 










325 










330 










335 




Glu 


Gin 


Thr 


His 


Cys 


Gly 


Leu 


Ala 


Val 


Ala 


Glu 


Glu 


Arg 


Lys 


Asp 


Leu 








340 










345 










350 






Phe 


Met 


Phe 


Phe 


Arg 


Ser 


Leu 


His 


Phe 


Phe 


Val 


Glu 


Trp 


Phe 


Glu 


Tyr 






355 










360 










365 








Arg 


Lys 


Arg 


Thr 


Phe 


Lys 


His 


Leu 


Lys 


Trp 


Asp 


Glu 


Asp 


Tyr 


Asp 


Gin 



370 375 380 
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Glu 


Pro 


Asp 


Asp 


Asp 


Tyr 


Gin 


Thr 


Gly 


Phe Pro 


Phe 


Arg 


Gin 


Asn 


Val 


385 










390 








395 










400 


Asp 


Tyr 


Gly 


Val 


Pro 


Phe 


His 


Gin 


Tyr 


Thr Leu 


Gly 


Cys 


Val 


Ser 


Glu 










405 










410 








415 




Cys 


Phe 


Cys 


Pro 


Thr 


Asn 


Phe 


Pro 


Ser 


Ser Met 


Tyr 


Cys 


Asp 


Asn 


Arg 








420 










425 








430 






Lys 


Leu 


Lys 


Thr 


He 


Pro 


Asn 


He 


Pro 


Met His 


He 


Gin 


Gin 


Leu 


Tyr 






435 










440 








445 








Leu 


Gin 


Phe 


Asn 


Glu 


He 


Glu 


Ala 


Val 


Thr Ala 


Asn 


Ser 


Phe 


He 


Asn 




450 










455 








460 










Ala 


Thr 


His 


Leu 


Lys 


Glu 


lie 


Asn 


Leu 


Ser His 


Asn 


Lys 


He 


Lys 


Ser 


465 










470 








475 










480 


Gin 


Lys 


lie 


Asp 


Tyr 


Gly 


Val 


Phe 


Ala 


Lys Leu 


Pro 


Asn 


Leu 


Leu 


Gin 










485 










490 








495 




Leu 


His 


Leu 


Glu 


His 


Asn 


Asn 


Leu 


Glu 


Glu Phe 


Pro 


Phe 


Pro 


Leu 


Pro 








500 










505 








510 






Lys 


Ser 


Leu 


Glu 


Arg 


Leu 


Leu 


Leu 


Gly 


Tyr Asn 


Glu 


He 


Ser 


Lys 


Leu 






515 










520 








525 








Gin 


Thr 


Asn 


Ala 


Met 


Asp 


Gly 


Leu 


Val 


Asn Leu 


Thr 


Met 


Leu Asp 


Leu 




530 










535 








540 










Cys 


Tyr 


Asn 


Tyr 


Leu 


His 


Asp 


Ser 


Leu 


Leu Lys 


Asp 


Lys 


He 


Phe 


Ala 


545 










550 








555 










560 


Lys 


Met 


Glu 


Lys 


Leu 


Met 


Gin 


Leu 


Asn 


Leu Cys 


Ser 


Asn 


Arg 


Leu 


Glu 










565 










570 








575 




Ser 


Met 


Pro 


Pro 


Gly 


Leu 


Pro 


Ser 


Ser 


Leu Met 


Tyr 


Leu 


Ser 


Leu 


Glu 








580 










585 








590 






Asn 


Asn 


Ser 


He 


Ser 


Ser 


He 


Pro 


Glu 


Lys Tyr 


Phe 


Asp 


Lys 


Leu 


Pro 






595 










600 








605 








Lys 


Leu 


His 


Thr 


Leu 


Arg 


Met 


Ser 


His 


Asn Lys 


Leu 


Gin 


Asp 


He 


Pro 




610 










615 








620 










Tyr 


Asn 


lie 


Phe 


Asn 


Leu 


Pro 


Asn 


He 


Val Glu 


Leu 


Ser 


Val Gly 


His 


625 










63 0 








635 










640 


Asn 


Lys 


Leu 


Lys 


Gin 


Ala 


Phe 


Tyr 


He 


Pro Arg 


Asn 


Leu 


Glu 


His 


Leu 










645 










650 








655 




Tyr 


Leu 


Gin 


Asn 


Asn 


Glu 


He 


Glu 


Lys 


Met Asn 


Leu 


Thr 


Val 


Met 


Cys 








660 










665 








670 






Pro 


Ser 


He 


Asp 


Pro 


Leu 


His 


Tyr 


His 


His Leu 


Thr 


Tyr 


He Arg 


Val 






675 










680 








685 








Asp 


Gin 


Asn 


Lys 


Leu 


Lys 


Glu 


Pro 


He 


Ser Ser 


Tyr 


He 


Phe 


Phe 


Cys 




690 










695 








700 










Phe 


Pro 


His 


He 


His 


Thr 


lie 


Tyr 


Tyr 


Gly Glu 


Gin 


Arg 


Ser 


Thr 


Asn 


705 










710 








715 










720 


Gly 


Gin 


Thr 


He 


Gin 


Leu 


Lys 


Thr 


Gin 


Val Phe 


Arg 


Arg 


Phe 


Pro 


Asp 










725 










730 








735 




Asp 


Asp 


Asp 


Glu 


Ser 


Glu 


Asp 


His 


Asp 


Asp Pro 


Asp 


Asn 


Ala 


His 


Glu 








740 










745 








750 






Ser 


Pro 


Glu 


Gin 


Glu 


Gly 


Ala 


Glu 


Gly 


His Phe 


Asp 


Leu 


His 


Tyr 


Tyr 



755 760 765 



Glu Asn Gin Glu 
770 



<210> 278 
<211> 65 
<212> PRT 

<213> Homo sapiens 



. <400> 278 

Ser Arg Arg Arg Gly Gly Val Ser Ala Pro Thr Ser Phe Tyr Gly Arg 

15 10 15 

Asp Arg Arg Met Phe Pro Ala Gin Glu Glu Ala Asp Arg Thr Val Phe 
20 25 30 
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Val Gly Asn Leu Glu Ala Arg Val Arg Glu Glu lie Leu Tyr Glu Leu 

35 40 45 

Phe Leu Gin Val Leu Cys Pro Arg Glu Met Gly lie Leu Ser lie Ser 
50 55 "* 60 

Pro 
65 



<210> 279 
<211> 294 
<212> PRT 
<213> Homo sapiens 



<400> 279 



Met 


Ser 


Arg 


Trp Gly 


Ala 


Ala 


Val 


1 






5 








His 


Phe 


Ala 


His Ala 


His 


lie 


Thr 








20 








Gly 


Arg 


Arcr 
-> 


Lys Arcr 


Arcr 


Ser 


Ser 






35 








40 


Ala 


Asn 


Ala 


Gin Ala 


His 


Phe 


Leu 




50 








55 




Gly 


Pro 


Gin 


Asn Arg 


Ala 


Glv 


Ala 


65 








70 






Ser 


Phe 


Pro 


Lys Tyr 


Lvs 


Pro 


Ser 








85 








Leu 


Asp 


Pro 


Ala Glu 


Tyr 


Asn 


lie 








100 








Ala 


Glu 


Arg 


Leu Ala 


His 


Arg 


Ala 






115 








120 


Gin 


Tyr 


Asn 


Asp Pro 


Asn 


Arg 


Arg 




130 








135 




Leu 


Arg 


Trp 


Ala Tyr 


Ala 


Arg 


Thr 


145 








150 






Pro 


Thr 


Pro 


Lys Ser 


Ser 


Leu 


Met 








165 








Pro 


Leu 


lie 


Phe lie 


Tyr 


Tyr 


lie 








180 








Asn 


Gin 


Arg 


Trp Leu 


Thr 


Asp 


Ser 






195 








200 


Leu 


Leu 


Glu 


Arg Asp 


Asp 


Leu 


Thr 




210 








215 




Pro 


Ala 


Ala 


Phe Leu 


Arg 


Gly 


Asn 


225 








230 






Lys 


Cys 


Leu 


Asp Leu 


lie 


Ser 


Tyr 








245 








Ser 


Lys 


Thr 


Pro Phe 


Gin 


Thr 


Gly 








260 








Ser 


Leu 


Val 


lie Gly 


Gly 


Lys 


Gly 






275 








280 


Gly 


Glu 


Thr 


Leu Thr 


Lys 







290 



Gly Gin Gly 


Ala 


Leu 


Arcr 


Glu 


Glu 


10 










15 




Glu Arg Thr 


Arcr 


Arcr 

ZD 


Val 


Arcr 


Glu 


25 








30 






Leu Leu 


Thr 


Thr 


Ser 


Pro 


Thr 


Ser 








45 








Lys Leu 


Lys 


Val 


Ser 


He 


Asp 


Lys 






60 










He Val 


Pro 


Trp 


Phe 


Ala 


Lys 


Met 




75 










80 


Ser Leu Arg 


Thr 


Leu 


Pro 


Glu 


Thr 


90 










95 




Ser Pro 


Glu 


Thr 


Arg 


Arg 


Ala 


Gin 


105 








110 






Gin Leu Lys 


Arg 


Glu 


Tyr 


Leu 


Leu 








125 








Gly Leu 


He 


Glu 


Asn 


Pro 


Ala 


Leu 






140 










He Asn 


Val 


Tyr 


Pro 


Asn 


Phe 


Lys 




155 










160 


Gly Ala 


Phe 


Val 


Trp 


Asp 


Phe 


Gly 


170 










175 




He Lys 


Thr 


Glu 


Arg 


Trp 


Asp 


Pro 


185 








190 






Arg He 


Leu 


Lys 


Tyr 


Glu 


Ala 


He 








205 








Leu Thr 


Thr 


Asp 


Asn 


Ser 


Leu 


Asn 






220 










Pro Asn 


Pro 


Glu 


Glu 


Pro 


Glu 


His 




235 










240 


Gin Thr 


Arg 


Val 


Arg 


Leu 


Asp 


Leu 


250 










255 




Arg His Leu 


Phe 


He 


Asp 


Gly 


Ser 


265 








270 






His Asn Gly 


Tyr 


Ser 


Val 


Val 


Asp 



285 



<210> 280 
<211> 198 
<212> PRT 
<213> Homo sapiens 



<400> 280 
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Met 


Gin 


Thr 


Phe 


Thr 


Thr 


Cys 


He 


Ser 


1 








5 










Leu 


Leu 


Ala 


Asn 


Ala 


Ser 


Ser 


His 


Gly 








20 










25 


Val 


Gly 


He 


Cys 


Leu 


Leu 


Met 


Val 


Pro 






35 










40 




Gly 


Ser 


Ala 


Arg 


Gly 


Ala 


Thr 


Lys 


Val 




50 










55 






Cys 


Gin 


Asn 


Glu 


His 


Leu 


Gly 


Glu 


Leu 


65 










70 








Asn 


Gin 


Leu 


Asn 


He 


Arg 


Glu 


Asn 


Gly 










85 










Gin 


Glu 


Leu 


Glu 


Glu 


Ser 


Val 


Ala 


Glu 








100 










105 


Arg 


Asp 


Leu 


Glu 


Lys 


Met 


Met 


Ser 


Lys 






115 










120 




Leu 


Ser 


lie 


Ser 


Asp 


Leu 


Val 


Asn 


Phe 




130 










135 






Leu 


Val 


Lys 


Thr 


Ala 


Glu 


Gly 


He 


Val 


145 










150 








Phe 


Pro 


Ala 


Leu 


His 


Phe 


His 


Ala 


Tyr 










165 










Ser 


Phe 


Lys 


His 


Tyr 


He 


Tyr 


Met 


lie 








180 










185 


He 


Pro 


Asp 


Phe 


Leu 


Thr 












195 
















<210> 281 














<211> 1352 














<212> PRT 














<213> Homo 


sapiens 










<400> 281 












Met 


Pro 


Val 


Pro 


Ser 


Arg 


His 


He 


Asn 


1 








5 










Asp 


Ala 


Ala 


Gly 


Trp 


Tyr 


Glu 


Gly 


Pro 








20 










25 


Arg 


Pro 


Leu 


Gly 


Arg 


Arg 


Ser 


Ser 


Leu 






35 










40 




Thr 


Trp 


Phe 


Glu 


Pro 


Asn 


His 


Arg 


Pro 




50 










55 






Ala 


Ala 


Glu 


Pro 


Tyr 


Leu 


Tyr 


Arg 


Glu 


65 










70 








Ala 


Arg 


Lys 


Gly 


Ser 


Thr 


Pro 


Asp 


Phe 










85 










Ala 


Val 


Met 


Ser 


Gly 


Arg 


Ser 


Pro 


Leu 








100 










105 


Asp 


Pro 


Ser 


Gly 


Ala 


Ala 


Arg 


Val 


Pro 






115 










120 




Asp 


Pro 


Gly 


Val 


Ser 


Arg 


Pro 


Val 


Pro 




130 










13 5 






Arg 


Thr 


Ser 


Trp 


Asp 


Pro 


Met 


Gin 


Gly 


145 










150 








Ala 


Gly 


His 


Leu 


Tyr 


Arg 


Asp 


Pro 


Gly 










165 










Arg 


Gin 


Thr 


Gin 


Ser 


Arg 


Ala 


Ala 


Ser 








180 










185 


Gin 


Pro 


Asp 


Thr 


Arg 


Tyr 


Gly 


Ala 


Glu 






195 










200 




Gin 


Val 


Phe 


Ser 


Asp 


He 


Ser 


Glu 


Arg 




210 










215 
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Tyr 


Ser 


Glu 


Tyr 


Ser 


Cys 


Met 


10 










15 




Thr 


Leu 


Tyr 


Cys 


Lys 


Leu 


Arg 










30 






Ala 


Val 


Lys 


Asn 


Gin 


Ala 


Ser 








45 








Arg 


Arg 


Lys 


Cys 


Gin 


Ala 


Gly 






60 










Asp 


Asp 


Gly 


Thr 


Asp 


Gly 


Lys 




75 










80 


Gly 


Arg 


Gly 


Gin 


Asn 


Cys 


Glu 


90 










95 




Lys 


Asp 


Leu 


Ser 


Gin 


Thr 


Ser 










110 






His 


He 


Phe 


Leu 


Lys 


Pro 


Met 








125 








Leu 


Met 


Gin 


Val 


Ser 


Lys 


Val 






140 










Leu 


Gin 


Gin 


Leu 


Pro 


Leu 


Ala 




155 










160 


Gly 


Asn 


Leu 


Phe 


Pro 


Val 


Cys 


170 










175 




Asp 


His 


Pro 


He 


Phe 


He 


Ser 










190 






He 


Gly 


Arcr 


Ser 


Gin 


Ser 


Trp 

ST 


10 










15 




Trp 


Glu 


Asn 


Ala 


Glu 


Ser 


Leu 










30 






Thr 


Tyr 


Gly 


Thr 


Ala 


Glu 


Gly 








45 








Gin 


Asp 


Ala 


Ala 


Leu 


Pro 


Val 






60 










Ala 


Val 


Tyr 


Asn 


Ser 


Val 


Ala 




75 










80 


Thr 


Phe 


Tyr 


Asp 


Ser 


Arg 


Gin 


90 










95 




Leu 


Pro 


Arg 


Glu 


Tyr 


Tyr 


Ser 










110 






Lys 


Glu 


Pro 


Pro 


Leu 


Tyr 


Arg 








125 








Ser 


Tyr 


Gly 


Val 


Leu 


Gly 


Ser 






140 










Arg 


Ser 


Pro 


Ala 


Leu 


Gin 


Asp 




155 










160 


Gly 


Lys 


Met 


He 


Pro 


Gin 


Gly 


170 










175 




Pro 


Gly 


Arg 


Tyr 


Gly 


Arg 


Glu 










190 






Val 


Pro 


Ala 


Tyr 


Pro 


Leu 


Ser 








205 








Pro 


He 


Asp 


Pro 


Ala 


Pro 


Ala 



220 
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Arg 


Gin 


Val 


Ala 


Pro 


Thr 


Cys 


Leu 


Val 


Val 


Asp 


Pro 


Ser 


Ser 


Ala 


Ala 


225 










230 










235 










240 


Ala 


Pro 


Glu 


Gly 


Ser 


Thr 


Gly 


Val 


Ala 


Pro 


Gly 


Ala 


Leu 


Asn 


Arg 


Gly 










245 










250 










255 




Tyr 


Gly 


Pro 


Ala 


Arg 


Glu 


Ser 


He 


Pro 


Ser 


Lys 


Met 


Ala 


Tyr 


Glu 


Thr 








260 










265 










270 






Tyr 


Glu 


Ala 


Asp 


Leu 


Ser 


Thr 


Phe 


Gin 


Gly 


Pro 


Gly 


Gly 


Lys 


Arg 


Thr 






275 










280 










285 








Val 


Leu 


Pro 


Glu 


Phe 


Leu 


Ala 


Phe 


Leu 


Arg 


Ala 


Glu 


Gly 


lieu 


Ala 


Glu 




290 










295 










300 










Ala 


Thr 


Leu 


Gly 


Ala 


Leu 


Leu 


Gin 


Gin 


Gly 


Phe 


Asp 


Ser 


Pro 


Ala 


Val 


305 










310 










315 










320 


Leu 


Ala 


Thr 


Leu 


Glu 


Asp 


Ala 


Asp 


He 


Lys 


Ser 


Val 


Ala 


Pro 


Asn 


Leu 










325 










330 










335 




Gly 


Gin 


Ala 


Arg 


Val 


Leu 


Ser 


Arg 


Leu 


Ala 


Asn 


Ser 


Cys 


Arg 


Thr 


Glu 








340 










345 










350 






Met 


Gin 


Leu 


Arg 


Arg 


Gin 


Asp 


Arg 


Gly 


Gly 


Pro 


Leu 


Pro 


Arg 


Ala 


Arg 






355 










360 










365 








Ser 


Ser 


Ser 


Phe 


Ser 


His 


Arg 


Ser 


Glu 


Leu 


Leu 


His 


Gly 


Asp 


Leu 


Ala 




370 










375 










380 










Ser 


Leu 


Gly 


Ala 


Ala 


Ala 


Pro 


Leu 


Gin 


Thr 


Ala 


Ser 


Pro 


Arg 


Ala 


Gly 


385 










390 










395 










400 


Asp 


Pro 


Ala 


Arg 


Arg 


Pro 


Ser 


Ser 


Ala 


Pro 


Ser 


Gin 


His 


Leu 


Leu 


Glu 










405 










410 










415 




Thr 


Ala 


Ala 


Thr 


Tyr 


Ser 


Ala 


Pro 


Gly 


Val 


Gly 


Thr 


His 


Ala 


Pro 


His 








420 










425 










430 






Phe 


Pro 


Ser 


Asn 


Ser 


Gly 


Tyr 


Ser 


Ser 


Pro 


Thr 


Pro 


Cys 


Ala 


Leu 


Thr 






435 










440 










445 








Ala 


Arg 


Leu 


Ser 


Pro 


Thr 


Tyr 


Pro 


Leu 


Gin 


Ala 


Gly 


Val 


Ala 


Leu 


Thr 




450 










455 










460 










Asn 


Pro 


Gly 


Pro 


Ser 


Asn 


Pro 


Leu 


His 


Pro 


Gly 


Pro 


Arcr 


Thr 


Ala 


Tyr 


465 










470 










475 










480 


Ser 


Thr 


Ala 


Tyr 


Thr 


Val 


Pro 


Met 


Glu 


Leu 


Leu 


Lys 


Arcr 


Glu 


Arcr 


Asn 










485 










490 










495 




Val 


Ala 


Ala 


Ser 


Pro 


Leu 


Pro 


Ser 


Pro 


His 


Gly 


Ser 


Pro 


Gin 


Val 


Leu 








500 










505 










510 






Arg 


Lys 


Pro 


Gly 


Ala 


Pro 


Leu 


Gly 


Pro 


Ser 


Thr 


Leu 


Pro 


Pro 


Ala 


Ser 






515 










520 










525 








Gin 


Ser 


Leu 


His 


Thr 


Pro 


His 


Ser 


Pro 


Tyr 


Gin 


Lys 


Val 


Ala 


Arg 


Arg 




530 










535 










540 










Thr 


Gly 


Ala 


Pro 


lie 


He 


Val 


Ser 


Thr 


Met 


Leu 


Ala 


Pro 


Glu 


Pro 


He 


545 










550 










555 










560 


Gin 


Phe 


Ala 


Gly 


Gin 


Ala 


Val 


Gin 


Ser 


Asp 


Asn 


Val 


Arg 


Lys 


Ala 


Tyr 










565 










570 










575 




Ala 


Ala 


Gly 


Thr 


Pro 


Val 


Arg 


Pro 


Thr 


Ser 


Pro 


Gly 


Asp 


Thr 


Asp 


Lys 








580 










585 










590 






Trp 


Gly 


Leu 


Gin 


Ala 


Arg 


Ala 


Pro 


Gly 


Arg 


Ala 


Val 


Asp 


Pro 


Arg 


Asn 






595 










600 










605 








Met 


lie 


Ser 


Ala 


Gin 


Glu 


His 


Lys 


Val 


Val 


Glu 


Cys 


Met 


Ala 


Arg 


Arg 




610 










615 










620 










Ser 


Ala 


Thr 


Cys 


Phe 


Val 


Phe 


Gly 


Gin 


Leu 


Cys 


Arg 


Leu 


His 


Ser 


Thr 


625 










630 










635 










640 


Ser 


Ser 


Asp 


Pro 


Val 


Gly 


Val 


Asp 


Phe 


He 


Leu 


Ser 


Met 


Glu 


Asp 


Val 










645 










650 










655 




Gly 


Arg 


Gly 


Lys 


Ser 


Arg 


Asn 


Pro 


Asp 


Ser 


Trp 


Ser 


Pro 


Asn 


Ala 


Val 








660 










665 










670 






Val 


Trp 


Asp 


Ala 


Ser 


Gly 


Val 


Gly 


Gly 


Glu 


Arg 


Val 


Leu 


Gin 


Tyr 


Gin 






675 










680 










685 








Leu 


Asp 


Met 


Asn 


Thr 


Val 


Pro 


Pro 


Gin 


Gly 


Trp 


Thr 


Thr 


Arg 


Lys 


Thr 




690 










695 










700 










Arg 


Val 


Cys 


Cys 


Lys 


His 


Glu 


Ala 


Ser 


Pro 


Ser 


Pro 


He 


Ser 


Ala 


Leu 


705 










710 










715 










720 


Ala 


Ala 


He 


Ala 


Lys 


Glu 


Glu 


Gly 


Val 


He 


Leu 


Leu 


Leu 


Trp 


Thr 


Phe 










725 










730 










735 
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Thr 


Leu 


Gly 


Asn 


Lys 


Arg 


Leu 


Gly 


Gly 


Ser 


Ala 


Thr 


Arg 


Val 


Gly 


Tyr 








740 










745 










750 






Ala 


Glu 


Ala 


Gin 


Ala 


Glu 


Ala 


Pro 


Ser 


Cys 


Lys 


Ala 


Thr 


Thr 


Val 


Thr 






755 










760 










765 








Leu 


Ser 


Ser 


Gly 


Ser 


Ser 


His 


Glu 


Cys 


Asp 


Ser 


Ser 


Val 


Ser 


Ser 


Lys 




770 










775 










780 








Thr 


Ala 


Thr 


Cys 


Arg 


Asp 


Phe 


Met 


Gly 


Gin 


Pro 


Trp 


Gly 


His 


Ala 


Ser 


785 










790 










795 










800 


lie 


Pro 


Pro 


Thr 


Pro 


Asn 


Pro 


Pro 


Pro 


Pro 


Ala 


Val 


Val 


Pro 


Gly 


He 










805 










810 










815 




Phe 


Ser 


Gin 


His 


Glu 


Asn 


Pro 


Leu 


Ala 


Phe 


Leu 


Phe 


Ser 


Arg 


Leu 


Ala 








820 










825 










830 






Met 


Lys 


Asp 


Leu 


Leu 


Pro 


Gly 


Phe 


Glu 


Pro 


Gin 


Thr 


Leu 


Asp 


Arg 


Ser 






835 










840 










845 








Arg 


Ala 


Ser 


Leu 


Ser 


His 


Val 


Leu 


Arg 


Ala 


Arg 


Pro 


Ser 


Gly 


Arg 


Val 




850 










855 










860 








Glu 


Gly 


lie 


Arg 


Pro 


Gin 


He 


Met 


Asn 


Gly 


Pro 


Leu 


His 


Pro 


Arg 


Pro 


865 










870 










875 








880 


Leu 


Val 


Ala 


Leu 


Leu 


Asp 


Gly 


Arg 


Asp 


Cys 


Thr 


Val 


Glu 


Met 


Pro 


He 










885 










890 










895 




Leu 


Lys 


Asp 


Leu 


Ala 


Thr 


Val 


Ala 


Phe 


Cys 


Asp 


Ala 


Gin 


Ser 


Thr 


Gin 








900 










905 










910 






Glu 


lie 


His 


Glu 


Lys 


Val 


Leu 


Asn 


Glu 


Ala 


Val 


Gly 


Ala 


Met 


Met 


Tyr 






915 


, 








920 










925 






His 


Thr 


He 


Thr 


Leu 


Thr 


Arg 


Glu 


Asp 


Leu 


Glu 


Lys 


Phe 


Lys 


Ala 


Leu 




930 










935 










940 










Arg 


Val 


lie 


Val 


Arg 


He 


Gly 


Ser 


Gly 


Tyr 


Asp 


Asn 


Val 


Asp 


He 


Lys 


945 










950 










955 










960 


Ala 


Ala 


Gly 


Glu 


Leu 


Gly 


Glu 


Cys 


Glu 


Ala 


Ala 


Leu 


Ala 


Ala 


Trp 


Ser 










965 










970 










975 




Cys 


Pro 


Glu 


Leu 


Cys 


Gly 


Pro 


Cys 


Ser 


Gly 


Gly 


Leu 


Gly 


Glu 


Ala 


Ala 








980 










985 










990 






Gly 


Thr 


Gly 


Thr 


Thr 


Glu 


Gin 


Pro 


Leu 


Leu 


Ala 


Val 


Ala 


Arg 


Trp 


Leu 






995 








1000 








1005 






Pro 


Pro 


Gly 


Arg 


Ala 


Val 


Glu His 


Leu 


Ala 


Ala 


Leu 


Pro 


Ser 


His 


Asp 


1010 








1015 








1020 








Thr Gly 


lie 


Ala 


Val 


Cys 


Asn 


He 


Pro 


Ser 


Ala 


Ala 


Val 


Glu 


Glu 


Thr 


1025 








1030 








1035 








1040 


Ala Asp 


Ser 


Thr 


He 


Cys 


His 


He 


Leu 


Asn 


Leu 


Tyr Arg 


Arg 


Asn 


Thr 



1045 1050 1055 



Trp Leu Tyr Gin Ala Leu Arg Glu Gly Thr Arg Val Gin Ser Val Glu 

1060 1065 1070 

Gin He Arg Glu Val Ala Ser Gly Ala Ala Arg He Arg Gly Glu Thr 

1075 1080 1035 

Leu Gly Leu He Gly Phe Gly Arg Thr Gly Gin Ala Val Ala Val Arg 

1090 1095 1100 

Ala Lys Ala Phe Gly Phe Ser Val He Phe Tyr Asp Pro Tyr Leu Gin 
1105 1110 1115 " J 1120 

Asp Gly He Glu Arg Ser Leu Gly Val Gin Arg Val Tyr Thr Leu Gin 

1125 1130 1135 

Asp Leu Leu Tyr Gin Ser Asp Cys Val Ser Leu His Cys Asn Leu Asn 

1140 1145 1150 

Glu His Asn His His Leu He Asn Asp Phe Thr He Lys Gin Met Arg 

1155 1160 1165 

Ala Gly Ser He Pro Leu Trp Asn Ala Ala Arg Gly Gly Leu Val Asp 

1170 1175 H80 

Glu Lys Ala Leu Ala Gin Ala Leu Lys Glu Gly Arg He Arg Gly Ala 
H85 1190 H95 ~* 1200 

Ala Leu Asp Val His Glu Ser Glu Pro Phe Ser Phe Ala Gin Gly Pro 

1205 1210 1215 

Leu Lys Asp Ala Pro Asn Leu He Cys Thr Pro His Thr Ala Trp Tyr 

1220 1225 1230 

Ser Glu Gin Ala Ser Leu Glu Met Arg Glu Ala Ala Ala Thr Glu He 
1235 1240 1245 
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Arg Arg Ala He Thr Gly Arg He Pro Glu Ser Leu Arg Asn Cye Val 

1250 1255 1260 

Asn Lys Glu Phe Phe Val Thr Ser Ala Pro Trp Ser Val He Asp Gin 
1265 1270 1275 1280 

Gin Ala He His Pro Glu Leu Asn Gly Ala Thr Tyr Arg Tyr Pro Pro 

1285 1290 1295 

Gly He Val Gly Val Ala Pro Gly Gly Leu Pro Ala Ala Met Glu Gly 

1300 1305 1310 

He He Pro Gly Gly He Pro Val Thr His Asn Leu Pro Thr Val Ala 

1315 1320 1325 

His Pro Ser Gin Ala Pro Ser Pro Asn Gin Pro Thr Lys His Gly Asp 

1330 1335 1340 

Asn Arg Glu His Pro Asn Glu Gin 
1345 1350 



<210> 282 
<211> 181 
<212> PRT 

<213> Homo sapiens 



<400> 282 



Leu 


Leu 


Lys 


He 


Ser 


Gly He 


He 


Leu 


Lys 


Thr Gly Glu Ser Gin Asn 


1 








5 








10 


15 


Gin 


Leu 


Ala 


Val 


Asp 


Gin He 


Ala 


Phe 


Gin 


Lys Lys Leu Phe Gin Thr 








20 








25 




30 


Leu 


Arg 


Arg 


His 


Pro 


Ser Tyr 


Pro 


Lys 


He 


He Glu Glu Phe Val Ser 






35 








40 






45 


Gly 


Leu 


Glu 


Ser 


Tyr 


He Glu 


Asp 


Glu 


Asp 


Ser Phe Arg Asn Cys Leu 




50 








55 








60 


Leu 


Ser 


Cys 


Glu 


Arg 


Leu Gin 


Asp 


Glu 


Glu 


Ala Ser Met Gly Ala Ser 


65 










70 








75 80 


Tyr 


Ser 


Lys 


Ser 


Leu 


He Lys 


Leu 


Leu 


Leu 


Gly He Asp He Leu Gin 










85 








90 


95 


Pro 


Ala 


He 


He 


Lys 


Thr Leu 


Phe 


Glu 


Lys 


Leu Pro Glu Tyr Phe Phe 








100 








105 




110 


Glu 


Asn 


Lys 


Asn 


Ser 


Asp Glu 


He 


Asn 


He 


Pro Arg Leu He Val Ser 






115 








12 0 






125 


Gin 


Leu 


Lys 


Trp 


Leu 


Asp Arg 


Val 


Val 


Asp 


Gly Lys Asp Leu Thr Thr 




130 








135 








140 


Lys 


He 


Met 


Gin 


Leu 


He Ser 


He 


Ala 


Pro 


Glu Asn Leu Gin His Asp 


145 










150 








155 160 


He 


He 


Thr 


Ser 


Leu 


Pro Glu 


He 


Leu 


Gly 


Asp Ser Gin His Ala Asp 










165 








170 


175 


Val 


Gly 


Lys 


Glu 


Leu 
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<210> 283 
<211> 1385 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1) . . . (1385) 

<223> Xaa = X or * as defined in Table 6 



<400> 283 

Ly3 Arg Lys Arg Arg Arg Thr Trp Lys Arg Tyr Arg Ser He He Asp 
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1 








5 




His 


Leu 


Gin 


Glu 


Lys 


Arg 








20 






Thr 


Leu 


Val 


Gin 


Pro 


Glu 






35 








Met 


Pro 


He 


Tyr 


Pro 


Thr 




50 










Phe 


Leu 


Arg 


Pro 


Asn 


He 


55 










70 


Tyr 


Leu 


Asp 


Thr 


His 


Phe 










85 




Leu Arg 


Glu 


Gly 


He 


Leu 








100 






Leu 


Arg 


Lys 


Arg 


Lys 


Phe 






115 








He 


He 


Thr 


Pro 


Met 


Cys 




130 










Phe 


Asp 


Thr 


Lys 


Pro 


Leu 


145 










150 


Leu 


Leu 


Tyr 


Gly 


Ser 


Leu 










165 




Phe 


Leu 


Phe 


Ala 


Thr 


Val 








180 






Gly lie 


Val 


Gin 


Leu 


Cys 






195 








Glu 


Val 


Gin 


Pro 


Ser 


Asp 




210 










Phe 


Glu 


Ala 


Tyr 


Arg 


His 


225 










230 


Glu Asp 


Val 


Pro 


Phe 


Gin 










245 




Lys 


Glu 


Pro 


Arg 


Tyr 


Leu 








260 






Leu 


He 


Glu 


Asn 


Pro 


Ser 






275 








Gly Leu 


Arg 


His 


Pro 


Arg 




290 










Ser 


Lys 


Glu 


Ala 


Leu 


Lys 


305 










310 


Phe 


Ala 


Leu 


Thr 


Arg 


Glu 










325 




Gly Lys 


Thr 


Tyr 


Val 


Gly 








340 






Glu 


Ser 


Val 


Trp 


Gin 


He 






355 








Cys 


Tyr 


Thr 


Asn 


His 


Ala 




370 










Leu 


Ser 


Gly 


Arg 


Pro 


Gly 


385 










390 


Val 


Glu 


He 


Pro 


Glu 


Gly 










405 




Arg Glu 


Phe 


Arg 


Arg 


Asn 








420 






Ser 


He 


Met 


Thr 


Gin 


Met 






435 








Ala 


Lys 


Thr 


Leu 


Glu 


Cys 




450 










Leu 


Gin 


Lys 


Tyr 


He 


Ser 


465 










470 


Pro 


Gin 


Cys 


Arg 


He 


Val 










485 




Ser 


His 


Asp 


Ala 


Gly 


Val 








500 






Lys Val 


Phe 


Leu 


Gin 


Gin 



10 



Arg 


Glu Val 


Thr 


Leu 


Arg 




25 








Ala 


Glu Asp 


His 


Val 


Glu 




40 








Tyr 


Asn Glu 


Val 


His 


Leu 


55 








60 


lie 


Ser Gly 


Lys 


Tyr 


Asp 








75 




Arg 


Leu Leu 


Arg 


Glu 


Glu 






90 






Glu 


Leu Leu 


Gin 


Ser 


Phe 




105 








Asp 


Asp He 


Arg 


He 


Tyr 




12 0 








Ser 


Ser Ser 


Gly 


He 


val 


135 








140 


Lys 


Phe Val 


Arg 


Trp 


Gin 








155 




Val 


Cys Met 


Ser 


Lys 


Asp 






170 






Ser 


Asn Arg 


Glu 


Gin 


Glu 




185 








Phe 


Asn Glu 


Gin 


Ser 


Gin 




200 








Ser 


Phe Leu 


Met 


val 


Glu 


215 








220 


Val 


Leu Glu 


Gly 


Leu 


Gin 








235 




Arg 


Asn He 


Val 


Glu 


Cys 






250 






Leu 


Met Gly 


Gly 


Arg 


Tyr 




265 








Ala 


Thr Gly 


Glu 


Phe 


Leu 




280 








He 


Asn Val 


Leu 


Asp 


Pro 


295 








300 


Leu 


Asp Asp 


Ser 


Gin 


Met 








315 




Leu 


Ala He 


He 


Gin 


Gly 






330 






Leu 


Lys He 


Val 


Gin 


Ala 




345 








Ser 


Leu Gin 


Lys 


Phe 


Pro 




360 








Leu 


Asp Gin 


Phe 


Leu 


Gly 


375 








380 


lie 


Val Arg 


Val 


Gly 


Trp 








395 




Ser 


Phe Thr 


Leu 


Arg 


Glu 






410 






Leu 


Pro Met 


His 


Leu 


Arg 




425 








Lys 


Glu Ser 


Glu 


Gin 


Glu 




440 








Thr 


Met Arg 


Gly 


Val 


Leu 


455 








460 


Pro 


Pro Ala 


Leu 


Gly 


Lys 








475 




Asn 


Gly Phe 


Ser 


Ser 


Gin 






490 






Ala 


Xaa Val 


Leu 


Val 


Ser 




505 








Asp 


Leu Arg 


He 


Gin 


Pro 
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15 




Val 


Asp 


Thr 


Tyr 




30 






Ser 


Tyr 


Ara 


Ser 


45 








Asp 


Glu 


Arg 


Pro 


Ser 


Thr 


Ala 


He 








80 


He 


Val 


Arg 


Pro 






95 




Glu 


Asp 


Gin 


Gly 




110 






Phe 


Asp 


Thr 


Ara 


125 








Tyr 


Lys 


Val 


Gin 


Asn 


Ser 


Lys 










160 


Asn 


Phe 


Glu 


Thr 






175 




Asp 


Leu 


Cys 


Ara 




190 






Gin 


Leu 


Leu 


Ala 


205 








Thr 


Thr 


Ala 


Txnr 


Glu 


Val 


Gin 


Glu 








240 


Asn 


Ser 


His 


Val 






255 




Asp 


Phe 


Thr 


Pro 




270 






Ara 


Asn 


Val 


Glu 


285 








Gly 


Gin 




Pro 


Glu 


Ala 


Leu 


Gin 








320 


Pro 


Pro 


Gly 


Thr 






335 




Leu 


Leu 


Thr 


Asn 




350 






He 


Leu 


Val 


Val 


365 








Ara 


His 


Leu 


Gin 


Lys 


Gly 


Ala 


Thr 








400 


Leu 


Arg 


Asn 


Lys 






415 




Arg 


Ala 


Tyr 


Met 




430 






Leu 


His 


Glu 


Gly 


445 








Arg 


Glu 


Gin 


Tvr 


Ser 


His 


Glu 


Trp 








480 


His 


Trp 


Lys 


His 






495 




Val 


Leu 


Ser 


Arg 




510 






Gin 


Ala 


Glu 


Gly 
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515 










520 


Asp 


Glu 


Glu 


Glu 


Glu 


Gly 


Glu 


Glu 




530 










535 




Ser 


Gin 


Arg 


Glu 


Ala 


Asp 


Pro 


Asp 


545 










550 






Arg 


Lys 


Arg 


Trp 


Xaa 


Gly 


Pro 


Ser 










565 








Gin 


Thr 


Arg 


Ser 


Trp 


Leu 


Lys 


Cys 








580 










Val 


Ala 


Leu 


Gly 


Gin 


Gin 


Leu 


Asp 






595 










600 


Arg 


Pro 


Ser 


Asp 


Asn 


Gin 


Lys 


Lys 




610 










615 




Glu 


Leu 


Arg 


Lys 


Leu 


Asn 


Thr 


Met 


625 










630 






Glu 


Asp 


Val 


Trp 


Gin 


Leu 


Asp 


Leu 










645 








Leu 


Trp 


Leu 


Gin 


Leu 


Tyr 


Gin 


Ala 








660 










Tyr 


Glu 


Arg 


Gin 


Tyr 


Arg 


Thr 


Ser 






675 










680 


Leu 


Gin 


Glu 


ASp 


Leu 


His 


He 


Leu 




690 










695 




Thr 


Thr 


Thr 


Gly 


Ala 


Ala 


Lys 


Tyr 


705 










710 






Pro 


Arg 


lie 


Val 


He 


Val 


Glu 


Glu 










725 








Thr 


He 


Ala 


Thr 


Leu 


Ser 


Lys 


Ala 








740 










Asp 


His 


Gin 


Gin 


Leu 


Arg 


Pro 


Ser 






755 










760 


Asn 


Phe 


Asn 


Leu 


Glu 


Val 


Ser 


Leu 




770 










775 




He 


Pro 


Phe 


Val 


Arg 


Leu 


Asn 


Tyr 


785 










790 






Ala 


Arg 


Leu 


Leu 


Thr 


Pro 


His 


He 










805 








Ser 


Val 


Leu 


Lys 


Tyr 


Glu 


Lys 


He 








820 










Phe 


Val 


Glu 


His 


Asn 


Phe 


Pro 


Glu 






835 










840 


His 


Gin 


Asn 


Gin 


His 


Glu 


Ala 


His 




850 










855 




Phe 


Leu 


Cys 


Gin 


Glu 


Tyr 


Leu 


Pro 


865 










870 






Tyr 


Thr 


Gly 


Gin 


Leu 


Phe 


Cys 


Leu 










885 








Phe 


Ala 


Gly 


Val 


Arg 


Val 


His 


Val 








900 










Asn 


Asp 


He 


He 


Leu 


Leu 


Ser 


Leu 






915 










920 


val 


Gly 


Phe 


Leu 


Gin 


lie 


Ser 


Asn 




930 










935 




Ala 


Lys 


Lys 


Gly 


Met 


Tyr 


Cys 


He 


945 










950 






Val 


Pro 


Leu 


Trp 


Ser 


Lys 


lie 


He 










965 








He 


Gly 


Pro 


Met 


Leu 


Arg 


Leu 


Cys 








980 










Thr 


Leu 


Val 


Ser 


Lys 


Ala 


Ser 


Asp 






995 








1000 


Cys 


Ser 


Leu 


Pro 


Cys 


Glu 


Phe 


Arg 



1010 1015 



Arg Ala Cys His Pro Tyr Asp Ser 



525 



Arg 


Glu 


Phe 


Arg 


Leu 


He 


Arg 


Asp 








540 










Phe 


Lys 


Gin 


Thr 


Gly 


Xaa 


Leu 


Arg 






555 










560 


Gly 


Gly 


Arg 


Arg 


Lys 


Arg 


val 


Glu 




570 










575 




Phe 


Trp 


Pro 


Xaa 


Gly 


Xaa 


Thr 


He 


585 










590 






Arg 


Ser 


Lys 


Pro 


Gin 


Glu 


Ser 


Gly 










605 








L Y S 


Met 


Lys 


Lys 


Arg 


Val 


Lys 


Asp 








620 










Thr 


Ala 


Ala 


Glu 


Ala 


Asn 


Glu 


He 






635 










640 


Ser 


Ser 


Arg 


Trp 


Gin 


Leu 


Tyr 


Arg 




650 










655 




Asp 


Thr 


Arg 


Arg 


Lys 


He 


Leu 


Ser 


665 










670 






Ala 


Glu 


Arg 


Met 


Ala 


Glu 


Leu 


Arg 










685 








Lys 


Asp 


Ala 


Gin 


Val 


Val 


Gly 


Met 








700 










Arg 


Gin 


He 


Leu 


Gin 


Lys 


Val 


Glu 






715 










720 


Ala 


Ala 


Glu 


Val 


Leu 


Glu 


Ala 


His 




730 






- 




735 




Cys 


Gin 


His 


Leu 


He 


Leu 


He 


Gly 


745 










750 






Ala 


Asn 


Val 


Tyr 


Asp 


Leu 


Ala 


Lys 










765 








Phe 


Glu 


Arg 


Leu 


Val 


Lys 


Val 


Asn 








780 










Gin 


His 


Arg 


Met 


Cys 


Pro 


Glu 


He 






795 










800 


Tyr 


Gin 


Asp 


Leu 


Glu 


Asn 


His 


Pro 




810 










815 




Lys 


Gly 


Val 


Ser 


Ser 


Asn 


Leu 


Phe 


825 










830 






Gin 


Glu 


Ser 


Lys 


Arg 


Arg 


Lys 


Ser 










845 








Asn 


Val 


Val 


Glu 


Leu 


Cys 


Lys 


Tyr 








860 










Ser 


Gin 


He 


Thr 


He 


Leu 


Thr 


Thr 






875 










880 


Arg 


Lys 


Leu 


Met 


Pro 


Ala 


Lys 


Thr 




890 










895 




Val 


Asp 


Lys 


Tyr 


Gin 


Gly 


Glu 


Glu 


905 










910 






Val 


Arg 


Ser 


Asn 


Gin 


Glu 


Gly 


Lys 










925 








Arg 


He 


Cys 


Val 


Ala 


Leu 


Ser 


Arg 








940 










Gly 


Asn 


Met 


Gin 


Met 


Leu 


Ala 


Lys 






955 










960 


His 


Thr 


Leu 


Arg 


Glu 


Asn 


Asn 


Gin 




970 










975 




Cys 


Gin 


Asn 


His 


Pro 


Glu 


Thr 


His 


985 










990 






Phe 


Gin 


Lys 


Val 


Pro 


Glu 


Gly 


Gly 








1005 








Leu 


Gly 


Cys 


Gly His 


Val 


Cys 


Thr 






1020 










Ser 


His 


Lys 


Glu 


Phe 


Gin 


Cys 


Met 
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1025 1030 1035 1040 

Lys Pro Cys Gin Lys Val lie Cys Gin Glu Gly His Arg Cys Pro Leu 

1045 1050 1055 

Val Cys Phe Gin Glu Cys Gin Pro Cys Gin Val Lys Val Pro Lys Thr 

1060 1065 1070 

lie Pro Arg Cys Gly His Glu Gin Met Val Pro Cys Ser Val Pro Glu 

1075 1080 1085 

Ser Asp Phe Cys Cys Gin Glu Pro Cys Ser Lys Ser Leu Arg Cys Gly 

1090 1095 1100 

His Arg Cys Ser His Pro Cys Gly Glu Asp Cys Val Gin Leu Cys Ser 
1105 1110 1115 1120 

Glu Met Val Thr He Lys Leu Lys Cys Gly His Ser Gin Pro Val Lys 

1125 1130 1135 

Cys Gly His Val Glu Gly Leu Leu Tyr Gly Gly Leu Leu Val Lys Cys 

1140 1145 1150 

Thr Thr Lys Cys Gly Thr He Leu Asp Cys Gly His Pro Cys Pro Gly 

1155 1160 1165 

Ser Cys His Ser Cys Phe Glu Gly Arg Phe His Glu Arg Cys Gin Gin 

1170 1175 1180 

Pro Cys Lys Arg Leu Leu He Cys Ser His Lys Cys Gin Lys Pro Cys 
1185 1190 1195 1200 

He Gly Glu Cys Pro Pro Cys Gin Arg Thr Cys Gin Asn Arg Cys Val 

1205 1210 1215 

His Ser Gin Cys Lys Lys Lys Cys Glu Glu Leu Cys Ser Pro Cys Val 

1220 1225 1230 

Glu Pro Met Cys Ser Arg Cys Gin His Tyr Gin Cys Thr Lys Leu Cys 

1235 1240 1245 

Ser Glu Pro Cys Asn Arg Pro Pro Cys Tyr Val Pro Cys Thr Lys Leu 

1250 1255 1260 

Leu Val Cys Gly His Pro Cys He Gly Leu Cys Gly Glu Pro Cys Pro 
1265 1270 1275 1280 

Lys Lys Cys Arg He Cys His Met Asp Glu Val Thr Gin He Phe Phe 

, 1285 1290 1295 

Gly Phe Glu Asp Glu Pro Asp Ala Arg Phe Val Gin Leu Glu Asp Cys 

1300 1305 1310 

Ser His He Phe Glu Val Gin Ala Leu Asp Arg Tyr Met Asn Glu Gin 

1315 1320 1325 

Lys Asp Asp Glu Val Ala He Arg Leu Lys Val Cys Pro He Cys Gin 

1330 1335 1340 

Val Pro He Arg Lys Asn Leu Ser Tyr Gly Thr Ser He Lys Gin Arg 
1345 1350 1355 1360 

Leu Glu Glu He Glu He He Glu Glu Lys Tyr Pro Gly Leu He Arg 

1365 1370 * 1375 

Gly Asn Gly Asn Gin Pro Gly Thr Ala 
1380 1385 



<210> 284 
<211> 552 
<212> PRT 

<213> Homo sapiens 



<400> 284 
Ala Glu Arg Lys Leu Ser Glu Lys 

1 5 
Pro Asp Asn Arg Asn Pro Ala Phe 
20 

Lys Val Leu Leu Ala Gly Val Ser 
35 40 
Val Asp Phe Cys He Ser Gly Lys 

50 55 
Val He He Leu Ala Asp Asp Met 



Ser Leu Val Val Ala Ala Val Ala 

10 15 
Thr Thr Met Gly Trp Leu Phe Leu 

25 30 
Phe Ser Gly Phe Leu Tyr Pro Leu 
45 

Thr Arg Gly Gin Lys Pro Asn Phe 
60 

Gly Trp Gly Asp Leu Gly Ala Asn 
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Trp 


Ala 


Glu 


Thr 


Gly 


Met 


Arg 


Phe 








1UU 


Ser 


Arg 


Ala 


Ser 






115 




Thr 


Arg 


Asn 


Phe 




130 






Thr 


Thr 


Leu 


Ala 


145 








lie 


Gly 


Lys 


Trp 


Arg 


Gly 


Phe 


Asp 








180 


Cys 


Thr 


Asp 


Thr 






195 




Gin 


Gly 


Asp 


Gly 




210 






Val 


Ala 


Leu 


Pro 


225 








Asn 


Leu 


Ser 


Ser 


lie 


Gin 


Arg 


Ala 








260 


Leu 


Ala 


His 


Met 






275 




Pro 


Arg 


Gly 


Arg 




290 






Ser 


Leu 


Val 


Gly 


305 








Asn 


Thr 


Phe 


Leu 


Cys 


Glu 


Leu 


Ala 








340 


Arg 


Gin 


Gly 


Gly 






355 




Arg 


Val 


Pro 


Ala 




370 






Thr 


Ser 


Thr 


Ala 


385 








Ala 


Leu 


Ala 


Gin 


Asp 


Val 


Ser 


Glu 








420 


Leu 


Phe 


His 


Pro 






435 




Thr 


Val 


Arg 


Leu 




450 






Arg 


Ala 


Cys 


Asp 


465 








Leu 


He 


Phe 


Asn 


Arg 


Gly 


Gly 


Ala 








500 


Leu 


Ala 


Asp 


Val 






515 




Asp 


Tyr 


Thr 


Gin 




530 






He 


Ala 


Cys 


Arg 



545 





70 






Lys 


Asp 


Thr 


Ala 


85 








Val 


Asp 


Phe 


His 


Leu 


Leu 


Thr 


Gly 








120 


Til _ 

Ala 


Val 


Thr 


Ser 






135 




Glu 


Val 


Leu 


Gin 




150 






His 


Leu 


Gly 


His 


165 








Tyr 


Tyr 


Phe 


Gly 


Pro 


Gly 


Tyr 


Asn 








200 


Pro 


Ser 


Arg 


Asn 






215 




Leu 


Tyr 


Glu 


Asn 




230 






Leu 


Ala 


Gin 


Lys 


245 








Ser 


Thr 


Ser 


Gly 


His 


Val 


Pro 


Leu 








280 


Lys 


Ser 


Leu 


Tyr 






295 




Gin 


He 


Lys 


Asp 




310 






Trp 


Phe 


Thr 


Gly 


325 








Gly 


Ser 


Val 


Gly 


Ser 


Pro 


Ala 


Lys 








360 


Leu 


Ala 


Tyr 


Trp 






375 




Leu 


Leu 


Ser 


Val 




390 






Ala 


Ser 


Leu 


Pro 


405 








Val 


Leu 


Phe 


Gly 


Asn 


Ser 


Gly 


Ala 








440 


Glu 


Arg 


Tyr 


Lys 






455 




Gly 


Ser 


Thr 


Gly 




470 






Leu 


Glu 


Asp 


Asp 


485 








Glu 


Tyr 


Gin 


Ala 


Leu 


Gin 


Asp 


He 








520 


Asp 


Pro 


Ser 


Val 






535 




Cys 


Gin 


Ala 


Ala 




550 











75 




Asn 


Leu 


Asp 


Lys 




90 






Ala 


Ala 


Ala 


Ser 


105 








Arg 


Leu 


Gly 


Leu 


Val 


Gly 


Gly 


Leu 








140 


Gin 


Ala 


Gly 


Tyr 






155 




His 


Gly 


Ser 


Tyr 




170 






He 


Pro 


Tyr 


Ser 


185 








His 


Pro 


Pro 


Cys 


Leu 


Gin 


Arg 


Asp 








220 


Leu 


Asn 


He 


Val 






235 




Tyr 


Ala 


Glu 


Lys 




250 






Arcr 


Pro 


Phe 


Leu 


265 








Pro 


Val 


Thr 


Gin 


Gly 


Ala 


Gly 


Leu 








300 


Lys 


Val 


Asp 


His 






315 




Asp 


Asn 


Gly 


Pro 




330 






Pro 


Phe 


Thr 


Gly 


345 








Gin 


Thr 


Thr 


Trp 


Pro 


Gly 


Arcr 


Val 








380 


Leu 


Asp 


He 


Phe 






395 




Gin 


Gly 


Arcr 


Arcr 




410 






Arg 


Ser 


Gin 


Pro 


425 








Ala 


Gly 


Asp 


Phe 


Ala 


Phe 


Tyr 


He 








460 


Pro 


Glu 


Leu 


Gin 






475 




Thr 


Ala 


Glu 


Ala 




490 






Val 


Leu 


Pro 


Glu 


505 








Ala 


Asn 


Asp 


Asn 


Thr 


Pro 


Cys 


Cys 



540 



80 

Met Ala Ser Glu 
95 

Thr Cys Ser Pro 
110 

Arg Asn Gly Val 
125 

Pro Leu Asn Glu 

Val Thr Gly He 
160 

His Pro Asn Phe 
175 

His Asp Met Gly 
190 

Pro Ala cys Pro 
205 

Cys Tyr Thr Asp 

Glu Gin Pro Val 
240 

Ala Thr Gin Phe 
255 

Leu Tyr Val Ala 
270 

Leu Pro Ala Ala 
285 

Trp Glu Met Asp 

Thr Val Lys Glu 
320 

Trp Ala Gin Lys 
335 

Phe Trp Gin Thr 
350 

Glu Gly Gly His 
365 

Pro Val Asn Val 

Pro Thr Val Val 
400 

Phe Asp Gly Val 
415 

Gly His Arg Val 
430 

Gly Ala Leu Gin 
445 

Thr Gly Gly Ala 

His Lys Phe Pro 
480 

Val Pro Leu Glu 
495 

Val Arg Lys Val 
510 

He Ser Ser Pro 
525 

Asn Pro Tyr Gin 
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<210> 285 
<211> 294 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1) . . . (294) 

<223> Xaa = X or * as defined in Table 6 



<400> 285 



Pro 


Val 


Ala 


Thr 


Thr 


He 


Ser 


Gin 


1 








5 








Trp 

it 


Ser 


He 


Glv 

-* 


Val 


lie 


Thr 


Tvr 








20 










Phe 


Leu 


Glv 


Asp 


Thr 


Lys 


Gin 


Glu 






35 










40 


Ser 


Tvr 
y 


Asp 


Phe 


Asp 


Glu 


Glu 


Phe 




50 










55 




Gin 


Asp 


Phe 


He 


Arcr 


Lys 


Leu 


Leu 


65 










70 






Thr 


He 


Gin 


Glu 


Ala 


Leu 


Arcr 


His 










85 








Glv 


Arg 


Ala 


Pro 


Glu 


Gin 




Lys 








100 










Lys 


His 


Leu 


Arc 


Glu 


Tvr 
y 


Thr 


Leu 






115 










120 


Asn 


Asn 


Cys 


Tvr* 


Val 


Asn 


Phe 


Glu 




130 










13 5 




Val 


Ala 


Arg 


Val 


Asp 


Leu 


Gly 


Cys 


145 










150 






Thr 


He 


Gin 


Asp 


Asp 


Val 


Glu 


Ala 










165 








Glu 


Ala 


Trp 


Tyr 


Arg 


Asp 


Glu 


Asn 








180 










Gin 


Leu 


Arg 


Tyr 


Glu 


Phe 


Arg 


Lys 






195 










200 


Arg 


Glu 


Asp 


He 


Gin 


Ala 


Thr 


Gly 




210 










215 




Lys 


Leu 


Asp 


His 


Leu 


Gin 


Ala 


Gin 


225 










230 






Ser 


Ala 


Asp 


Leu 


Gin 


Trp 


He 


Gin 










245 








Glu 


Ser 


Gly 


Ser 


Ser 


Glu 


Gly 


Leu 








260 










Ser 


Glu 


Ser 


Leu 


Ser 


Glu 


Leu 


Leu 






275 










280 


Leu 


Ala 


Gly 


Trp 


Lys 


Leu 







290 



Pro 


Leu 
10 


Ser 


Leu 


Glu 


Ala 


Asp Met 
15 


He 


Leu 


Leu 


Ser Gly Ala 


Ser Pro 


25 










30 




Thr 


Leu 


Ala 


Asn 


He 
45 


Thr 


Ala Val 


Phe 


Ser 


Glu 


Thr 
60 


Ser 


Glu 


■ucu Ala 


Gly 


•**ctc* 




Thr Arg Lys 








75 








80 


Pro 


up 


IXC 


Thr 


Ser Lys 






90 










95 


Thr 


Glu 


Pro 


Thr 


Gin 


Leu 




105 










110 




Lys 


Cys 


His 


Ser 


Ser 
125 


Met 


Pro Pro 


Arg 


tr II ti 


Aid 


Cys 
140 


Val 


Val 


«JLU ASp 


Arg 


Ala 


Leu 
155 


Val 


Glu 


Ala 


His Asp 
160 


Leu 


Val 
170 


Ser 


He 


Phe 


Asn 


Glu Lys 
175 


Glu 


Ser 


Ala 


Arg 


His 


Asp 


Leu Ser 


185 










190 




Val 


Glu 


Ser 


Leu 


Lys 
205 


Lys 


Leu Leu 


Cys 


Ser 


Leu 


Gly Ser Met 


Ala Arg 








220 








Phe 


Glu 


He 


Leu Arg Gin 


Glu Leu 






235 








240 


Glu 


Leu 


Val 


Gly Ser Phe 


Gin Leu 




250 










255 


Gly 


Ser 


Thr 


Phe 


Tyr 


Gin 


Asp Thr 


265 










270 




Ser 


Arg 


Ser 


Cys 


Thr 
285 


Glu 


Glu Phe 



<210> 286 
<211> 51 
<212> PRT 

<213> Homo sapiens 



<400> 286 

Met Val Pro Val Phe Ser Val Glu Lys Asp Gly Glu Glu Leu Gly Ser 

15 10 15 

Phe Arg Pro Arg Trp Ala Asp Trp Leu Thr Gly Leu Leu Glu Trp Val 
20 25 30 
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Ser Val Glu Ser Leu Ser lie Tyr Cys lie Ser Gin Pro Val Tyr Met 

35 40 45 

Trp Val Glu 
50 



<210> 287 

<211> 6 

<212> PRT 

<213> Homo sapiens 



<400> 287 
Met Trp His Leu Ser Val 
1 5 



<210> 288 
<211> 116 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1) . . . (116) 

<223> Xaa = X or * as defined in Table 6 



<400> 288 



His 


Pro 


His 


Ser 


Pro 


Asp Pro 


Gly 


Ser 


Ala 


Leu Gly Ser Ser Ser Gly 


1 








5 








10 


15 


Gly 


Trp 


Leu 


Pro 


Ala 


Pro Leu 


Ser 


Pro 


Cys 


Arg Gly Xaa Ala Gly Ala 








20 








25 




30 


Gly 


Gly 


Gly 


Arg 


Arg 


Cys Arg 


Gly 


Arg 


Pro 


Trp Ser Arg Ala Gly Xaa 






35 








40 






45 


Ala 


Cys 


Ser 


Gly 


His 


Ala Gly 


Ser 


Arg 


Cys 


Cys Pro Ala Xaa Ser Val 




50 








55 








60 


Cys 


Gly 


Gly 


Leu 


Pro 


Gly Gly 


Ala 


Pro 


Gly 


Cys Leu Cys Lys Gly Gly 


65 










70 








75 80 


Ser 


Ala 


Gly 


Phe 


Cys 


Cys Gin 


Gly 


Pro 


Gly 


Cys Ser Cys Ser Gly Cys 










85 








90 


95 


Ser 


Gly 


Ser 


Gly 


His 


Gly Gly 


Tyr 


Arg 


His 


Arg Gin Gly Arg Pro Leu 








100 








105 




110 


Ser 


Ala 


Ser 


Gin 


















115 

















<210> 289 
<211> 1654 
<212> PRT 

<213> Homo sapiens 
<220> 

<22l> mis cofeature 
<222> (1) . . . (1654) 

<223> Xaa = X or * as defined in Table 6 



<400> 289 

Ser Val Tyr Lys Ala Asp Leu Glu Trp Leu Arg Gly He Gly Trp Met 
1 5 10 15 
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Pro 


Glu 


Gly 


Ser 








20 


Val 


Asn 


Glu 


Arg 






35 




Ser 


lie 


Val 


Asp 




50 






Gin 


He 


Ser 


Glu 


65 








Asn 


He 


Thr 


He 


Asn 


Ala 


Leu 


Gin 








100 


Thr 


Lys 


bill 


Lys 






115 




His 


Ala 


Lys 


Ala 




130 






Gly 


Tyr 


Arg 


Lys 


145 








Asp 


Asp 


Pro 


Lys 


Aap 


Arg 


Glu 


Tyr 








180 


Thr 


Pro 


Leu 


Asp 






195 




Leu 


Val 


Ser 


Asp 




210 






Leu 


Pro 


Asp 


Gin 


225 








Gin 


Ser 


Asp 


Pro 


Val 


Asn 


Val 


Pro 








260 


Ala 


Leu 


Gin 


He 






275 




Lys 


Met 


Thr 


Gly 




290 






Ala 


Lys 


Ala 


Ser 


305 








Tyr 


Glu 


Lys 


Gin 


Asp 


Pro 


Lys 


Leu 








340 


Arg 


Leu 


Tyr 


Lys 






355 




Pro 


Val 


Asp 


Met 




370 






Ala 


Ser 


Asp 


Val 


385 








Pro 


Asp 


Gin 


Asn 


Ser 


Asp 


Thr 


Glu 








420 


Ala 


Asp 


Met 


Thr 






435 




Thr 


Leu 


Glu 


Asp 




450 






Leu 


Thr 


Leu 


Thr 


465 








Thr 


Glu 


lie 


Glu 


Ser 


Ser 


He 


He 








500 


His 


Ser 


Ala 


Val 






515 





Val 


Glu 


Met 


Asn 


Leu 


Tyr 


Arg 


Thr 








40 


Thr 


Pro 


Glu 


Val 






55 




Lys 


Leu 


Tyr 


Gin 




70 






Pro 


Ser 


Asp 


Thr 


85 








He 


Ser 


Asn 


Lys 


Gly 


Tyr 


Asp 


He 








120 


Ser 


Arg 


Glu 


lie 






135 




Gin 


Leu 


Gly 


His 




150 






Ser 


Val 


Trp 


Ala 


165 








Lys 


Lys 


Ala 


Tyr 


Met 


Met 


Ser 


He 








200 


lie 


Asp 


Tyr 


Arg 






215 




Asn 


Asp 


Val 


He 




230 






Leu 


Tyr 


Arg 


Asn 


245 








Ala 


Asp 


Thr 


Pro 


Ser 


Asn 


Lys 


Arg 








280 


Tyr 


Asp 


Leu 


Arg 






295 




Arg 


Asp 


He 


Ala 




310 






Lys 


Gly 


His 


Tyr 


325 








Val 


Trp 


Ala 


Ala 


Lys 


Ala 


Tyr 


Asn 








360 


Val 


Ser 


He 


Ser 






375 




Asp 


Tyr 


Arg 


His 




390 






Asp 


Val 


He 


Gin 


405 








Pro 


Cys 


Ser 


Leu 


Ala 


Arg 


Gly 


Gin 








440 


Pro 


Ser 


Ala 


Ser 






455 




Ser 


Arg 


Met 


Thr 




470 






Lys 


Lys 


Leu 


Pro 


485 








Asp 


Ser 


Leu 


Glu 


Val 


Asp 


Phe 


Glu 



520 



Arg 


Val 


Lys 


Val 


25 








Arg 


Pro 


Glu 


Ala 


Val 


Leu 


Ala 


Lys 








60 


Glu 


Ala 


Trp 


Asn 






75 




Pro 


Glu 


Met 


Leu 




90 






Leu 


Tyr 


Gin 


Lys 


105 








Arg 


Ala 


Asp 


Ala 


Ala 


Ser 


Glu 


Tyr 








140 


His 


Met 


Gly 


Phe 






155 




He 


His 


Ala 


Ala 




170 






Glu 


Lys 


Ser 


Lys 


185 








Val 


Gin 


Ala 


Lys 


Asn 


Tyr 


Leu 


His 








220 


Gin 


Ala 


Lys 


Lys 






235 




Ala 


Trp 


Glu 


Lys 




250 






Leu 


Met 


Leu 


Gin 


265 








Tyr 


Gin 


Gin 


Ala 


Ala 


Asp Ala 


He 








300 


Ser 


Asp 


Tyr 


Leu 






315 




He 


Gly Cys 


Arg 




330 






Asn 


Val 


Leu 


Lys 


345 








Asp 


His 


Lys 


Ala 


Ala 


Ala Lys 


Glu 








380 


Tyr 


Leu 


His 


His 






3 95 




Ala 


Arg Lys 


Ala 




410 






Ala 


Gin 


Ala 


Gly 


425 








Ser 


Pro 


Leu 


Ala 


His 


Gly Gly 


Gin 








460 


Gly 


Glu 


Glu 


Gly 






475 




Arg 


Leu Tyr 


Lys 




490 






He 


Leu 


Phe 


Asn 


505 








Ala 


Leu 


Asn 


Val 



Ala 


Gin 


Asp 


Leu 




30 






Leu 


Ser 


Phe 


Thr 


45 








Ala 


Asn 


Ser 


Leu 


Lys 


Asp 


Lys 


Ser 








80 


Gin 


Ala 


His 


lie 






95 




Asp 


Trp 


Asn 


Asp 




110 






He 


Glu 


He 


Lys 


125 








Lys 


Tyr 


Lys 


Glu 


Arg 


Thr 


Leu 


Gin 








160 


Lys 


He 


Gin 


Ser 






175 




Gly 


He 


His 


Asn 




190 






Lys 


Cys 


Gin 


Val 


205 








Gin 


Trp 


Thr 


Cys 


Ala 


Tyr 


Asp 


Leu 








240 


Glu 


Lys 


Ala 


Asn 






255 




Ser 


Lys 


He 


Asn 




270 






Trp 


Glu 


Asp 


Val 


285 








Gly 


He 


Gin 


His 


Tyr 


Lys 


Thr 


Ala 








320 


Ser 


Ala 


Lys 


Glu 






335 




Met 


Gin 


Asn 


Asp 




350 






Lys 


He 


Ser 


He 


365 








Gly 


Gin 


Ala 


Leu 


Trp 


Ser 


Cys 


Phe 








400 


Tyr 


Asp 


Leu 


Gin 






415 




Val 


Gin 


Trp 


Val 




430 






Pro 


Leu 


Leu 


Glu 


445 








Thr 


Asp 


Ala 


Tyr 


Lys 


Glu 


Val 


He 








480 


Val 


Leu 


Lys 


Val 






495 




Lys 


Gly 


Glu 


Thr 




510 






He 


Val 


Arg 


Leu 



S25 
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He 


Glu 


Gin 


Ala 


Pro 


lie 


Gin 


Met 


Gly 


Glu 


Glu 


Ala 


Val 


Arq 


Trp 


Ala 




530 










535 










540 










Lys 


Leu 


Val 


He 


Pro 


Leu 


Val 


Val 


His 


Ser 


Ala 


Gin 


Lys 


Val 


His 


Leu 


545 










550 










555 










560 


Arg 


Gly 


Ala 


Thr 


Ala 


Leu 


Glu 


Met 


Gly 


Met 


Pro 


Leu 


Leu 


Leu 


Gin 


Lys 










565 










570 










575 




Gin 


Gin 


Glu 


He 


Ala 


Ser 


He 


Thr 


Glu 


Gin 


Leu 


Met 


Thr 


Thr 


Thr 


Leu 








580 










585 










590 






His 


Arg 


Ser 


Gly 


Ser 


Phe 


He 


Asn 


Ser 


Leu 


Leu 


Gin 


Leu 


Glu 


Glu 


Leu 






595 










600 










605 








Gly 


Phe 


Arg 


Ser 


Gly 


Ala 


Pro 


Met 


He 


Lys 


Lys 


He 


Ala 


Phe 


He 


Ala 




610 










615 










620 










Trp 


Lys 


Ser 


Leu 


He 


Asp 


Asn 


Phe 


Ala 


Leu 


Asn 


Pro 


Asp 


He 


Leu 


Cys 


625 










630 










635 










640 


Ser 


Ala 


Lys 


Arcr 


Leu 


Lys 


Leu 


Leu 


Met 


Gin 


Pro 


Leu 


Ser 


Ser 


He 


His 










645 










650 










655 




Val 


Arg 


Thr 


Glu 


Thr 


Leu 


Ala 


Leu 


Thr 


Lys 


Leu 


Glu 


Val 


Trp 


Trp 


Tyr 








660 










665 










670 






Leu 


Leu 


Met 


Ara 


Leu 


Gly 


Pro 


His 


Leu 


Pro 


Ala 


Asn 


Phe 


Glu 


Gin 


Val 






675 










680 










685 








Cys 


Val 


Pro 


Leu 


He 


Gin 


Ser 


Thr 


He 


Ser 


He 


Asp 


Ser 


Asn 


Ala 


Ser 




690 










695 










700 










Pro 


Gin 


Gly 


Asn 


Ser 


Cvs 


His 


Val 


Ala 


Thr 


Ser 


Pro 


Gly 


Leu 


Asn 


Pro 


705 










710 










715 










720 


Met 


Thr 


Pro 


Val 


His 


LVS 


Gly 


Ala 


Ser 


Ser 


Pro 


Tvr 


Glv 


Ala 


Pro 


Glv 










725 










730 










735 




Thr 


Pro 


Arg 


Met 


Asn 


Leu 


Ser 


Ser 


Asn 


Leu 


Glv 


Glv 


Met 


Ala 


Thr 


He 








740 










745 










750 






Pro 


Ser 


He 


Gin 


Leu 


Leu 


Glv 


Leu 


Glu 


Met 


Leu 


Leu 


His 


Phe 


Leu 


Leu 






755 










760 










765 








Gly 


Pro 


Glu 


Ala 


Leu 


Ser 


Phe 


Ala 


Lvs 


Gin 


Asn 


Lvs 


Leu 


Val 


Leu 


Ser 




770 










775 










780 










Leu 


Glu 


Pro 


Leu 


Glu 


His 


Pro 


Leu 


He 


Ser 


Ser 


Pro 


Ser 


Phe 


Phe 


Ser 


785 










790 










795 










800 


LVS 


His 


Ala 


Asn 


Thr 


Leu 


He 


Thr 


Ala 


Val 


His 


Asp 


Ser 


Phe 


Val 


Ala 










805 










810 










815 




Val 


Gly 


Lys 


Asp 


Ala 


Pro 


Gly 


Asn 


Lvs 


Lvs 


Glu 


Lvs 


Pro 


Glv 


Ser 


Glu 








820 










825 










830 






Val 


Leu 


Thr 


Leu 


Leu 


Leu 


Lys 


Ser 


Leu 


Glu 


Ser 


He 


Val 


Lvs 


Ser 


Glu 






835 










840 










845 








Val 


Phe 


Pro 


Val 


Ser 


Lys 


Thr 


Leu 


Gly 


Thr 


Pro 


Ala 


Leu 


Phe 


Leu 


He 




850 










855 










860 










Gin 


Leu 


He 


Phe 


Asn 


Asn 


Phe 


Leu 


Glu 


Cvs 


Gly 


Val 


Ser 


Asp 


Glu 


Arcr 


865 










870 










875 










880 


Phe 


Phe 


Leu 


Ser 


Leu 


Glu 


Ser 


Leu 


Val 


Gly 


Cys 


Val 


Leu 


Ser 


Gly 


Pro 










885 










890 










895 




Thr 


Ser 


Pro 


Leu 


Ala 


Phe 


Ser 


Asp 


Ser 


Val 


Leu 


Asn 


Val 


lie 


Asn 


Gin 








900 










905 










910 






Asn 


Ala 


Lys 


Gin 


Leu 


Glu 


Asn 


Lys 


Glu 


His 


Leu 


Trp 


Lys 


Met 


Trp 


Ser 






915 










920 










925 








Val 


He 


Val 


Thr 


Pro 


Leu 


Thr 


Glu 


Leu 


He 


Asn 


Gin 


Thr 


Asn 


Glu 


Val 




930 










935 










940 










Asn 


Gin 


Gly 


Asp 


Ala 


Leu 


Glu 


His 


Asn 


Phe 


Ser 


Ala 


He 


Tyr 


Gly 


Ala 


945 










950 










955 










960 


Leu 


Thr 


Leu 


Pro 


Val 


Asn 


His 


He 


Phe 


Ser 


Glu 


Gin 


Arg 


Phe 


Pro 


Val 










965 










970 










975 




Ala 


Thr 


Met 


Lys 


Thr 


Leu 


Leu 


Arg 


Thr 


Trp 


Ser 


Glu 


Leu 


Tyr 


Arg 


Ala 








980 










985 










990 






Phe 


Ala 


Arg 


Cys 


Ala 


Ala 


Leu 


Val 


Ala 


Thr 


Ala 


Glu 


Glu 


Asn 


Leu 


Cys 






995 








1000 








1005 








Cys 


Glu 


Glu 


Leu 


Ser 


Ser 


Lys 


He 


Met 


Ser 


Ser 


Leu 


Glu 


Asp 


Glu 


Gly 


1010 








1015 








1020 










Phe 


Ser 


Asn 


Leu 


Leu 


Phe Val Asp 


Arg 


He 


He 


Tyr 


He 


He 


Thr 


Val 



1025 1030 1035 1040 
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Met Val Asp Cys lie Asp Phe Ser Pro Tyr Asn He Lys Tyr Gin Pro 

1045 1050 1055 

Lys Val Lys Ser Pro Gin Arg Pro Ser Asp Trp Ser Lys Lys Lys Asn 

1060 1065 1070 

Glu Pro Leu Gly Lys Leu Thr Ser Leu Phe Lys Leu He Val Lys Val 

1075 1080 1085 

He Tyr Ser Phe His Thr Leu Ser Phe Lys Glu Ala His Ser Asp Thr 

1090 1095 1100 

Leu Phe Thr He Gly Asn Ser He Thr Gly He He Ser Ser Val Leu 
1105 1110 1115 1120 

Gly His He Ser Leu Pro Ser Met He Arg Lys He Phe Ala Thr Leu 

1125 1130 1135 

Thr Arg Pro Leu Ala Leu Phe Tyr Glu Asn Ser Lys Leu Asp Glu Val 

1140 1145 1150 

Pro Lys Val Tyr Ser Cys Leu Asn Asn Lys Leu Glu Lys Leu Leu Gly 

1155 1160 1165 

Glu He He Ala Cys Leu Gin Phe Ser Tyr Thr Gly Thr Tyr Asp Ser 

1170 1175 1180 

Glu Leu Leu Glu Gin Leu Ser Pro Leu Leu Cys He He Phe Leu His 
1185 1190 1195 1200 

Lys Asn Lys Gin He Arg Lys Gin Ser Ala Gin Phe Trp Asn Ala Thr 

1205 1210 1215 

Phe Ala Lys Val Met Met Leu Val Tyr Pro Glu Glu Leu Lys Pro Val 

1220 1225 1230 

Leu Thr Gin Ala Lys Gin Lys Phe Leu Leu Leu Leu Pro Gly Leu Glu 

1235 1240 1245 

Thr Val Glu Met Met Glu Glu Ser Ser Gly Pro Tyr Ser Asp Gly Leu 

1250 1255 1260 

Lys Leu Glu Ser Ser Ser Leu Lys Val Lys Gly Glu He Leu Leu Glu 
1265 1270 1275 1280 

Glu Glu Lys Ser Thr Asp Phe Val Phe He Pro Pro Glu Gly Lys Asp 

1285 1290 1295 

Ala Lys Glu Arg He Leu Thr Asp His Gin Lys Glu Val Leu Lys Thr 

1300 1305 1310 

Lys Arg Phe Glu Glu Gin Met Asp Ser Asp He Val He Pro Gin Asp 

1315 1320 1325 

Val Thr Glu Asp Cys Gly Met Ala Glu His Leu Glu Lys Ser Ser Leu 

1330 1335 1340 

Ser Asn Asn Glu Cys Gly Ser Leu Asp Lys Thr Ser Pro Glu Met Ser 
1345 1350 1355 1360 

Asn Ser Asn Asn Asp Glu Arg Lys Lys- Ala Leu He Ser Ser Arg Lys 

1365 1370 1375 

Thr Ser Thr Glu Cys Ala Ser Ser Thr Glu Asn Ser Phe Val Val Ser 

1380 1385 1390 

Ser Ser Ser Val Ser Asn Thr Thr Val Ala Gly Thr Pro Pro Tyr Pro 

1395 1400 1405 

Thr Ser Arg Arg Gin Thr Phe He Thr Leu Glu Lys Phe Asp Gly Ser 

1410 1415 1420 

Glu Asn Arg Pro Phe Ser Pro Ser Pro Leu Asn Asn He Ser Ser Thr 
1425 1430 1435 1440 

Val Thr Val Lys Asn Asn Gin Glu Thr Met He Lys Thr Asp Phe Leu 

1445 1450 1455 

Pro Lys Ala Lys Gin Arg Glu Gly Thr Phe Ser Lys Ser Asp Ser Glu 

1460 1465 1470 

Lys He Val Asn Gly Thr Lys Arg Ser Ser Arg Arg Ala Gly Lys Ala 

1475 1480 1485 

Glu Gin Thr Gly Asn Lys Arg Ser Lys Pro Leu Met Arg Ser Glu Pro 

1490 1495 1500 

Glu Lys Asn Thr Glu Glu Ser Val Glu Gly He Val Val Leu Glu Asn 
1505 1510 1515 1520 

Asn Pro Pro Gly Leu Leu Asn Gin Thr Glu Cys Val Ser Asp Asn Gin 

1525 1530 1535 

Val His Leu Ser Glu Ser Thr Met Glu His Asp Asn Thr Lys Leu Lys 
1540 1545 1550 
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Ala Ala Thr Val Glu Asn Ala Val Leu Leu Glu Thr Asn Thr Val Glu 

1555 1560 1565 

Glu Lys Asn Val Glu lie Asn Leu Glu Ser Lys Glu Asn Thr Pro Pro 

1570 1575 1580 

Val Val lie Ser Ala Asp Gin Met Val Asn Glu Asp Ser Gin Val Gin 
1585 1590 1595 1600 

lie Thr Pro Asn Gin Lys Thr Leu Arg Arg Ser Ser Arg Arg Arg Tyr 

1605 1610 1615 

Arg Ser Ser Arg Val Tyr Kis Xaa Lys Pro Arg Xaa Gly Lys Xaa Ser 

1620 1625 1630 

Ser Lys Lys Gly Thr Thr Xaa Gly Arg Arg Lys Thr Ser Ser Glu Glu 

1635 1640 1645 

Ser lie Ala Tyr Lys Arg 
1650 



<210> 290 
<211> 108 
<212> PRT 
<213> Homo sapiens 



<400> 290 



Lys 


Asp 


Ala 


Tyr Met Phe Lys Lys Gly 


Leu Leu Ala 


Leu Ala Leu Val 


1 






5 


10 


15 


Phe 


Ser 


Met 


Pro Val Phe Ala Ala Glu 


His Trp He 


Asp Val Arg Val 








20 25 




30 


Pro 


Glu 


Gin 


Tyr Gin Gin Glu His Val 


Gin Gly Ala 


He Asn He Pro 






35 


40 




45 


Leu 


Lys 


Glu 


Val Lys Glu Arg He Ala 


Thr Ala Val 


Pro Asp Lys Asn 




50 




55 


60 




Asp 


Thr 


Val 


Lys Val Tyr Cys Asn Ala 


Gly Arg Gin Ser Gly Gin Ala 


65 






70 


75 


80 


Lys 


Glu 


He 


Leu Ser Glu Met Gly Tyr 


Thr His Val 


Glu Asn Ala Gly 








85 


90 


95 


Gly 


Leu 


Lys 


Asp He Ala Met Pro Lys 


Val Lys Gly 





100 105 



<210> 291 
<211> 119 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1)...(119) 

<223> Xaa = X or * as defined in Table 6 



<400> 291 



Ser 


Ser 


Pro 


Ser Cys 


His Leu 


Val Lys 


Lys 


He Lys lie Lys Met Lys 


1 






5 






10 


15 


Ser 


Pro 


Ala 


Leu Arg 


Gly Leu 


Ser Arg 


Gin 


His Thr Lys Ser Pro Val 








20 




25 




30 


Thr 


Phe 


Trp 


Trp Met 


Thr Phe 


Gly Asp 


Thr 


Ser Arg Pro Ser Gin Asp 






35 






40 




45 


Thr 


Leu 


Pro 


Met Asp 


Leu Gin 


Gin Leu 


Leu 


Gly Val Thr Lys Val Cys 




50 






55 






60 


Ser 


Lys 


Ala 


Thr Ser 


Pro Thr 


Ser Gin 


Arg 


Gly Gin Glu Val He Ser 


65 








70 






75 80 


Thr 


Pro 


Thr 


Ser Lys 


Ser Gly 


Pro Phe 


He 


Gly Arg Gly Ser Xaa Gly 



189 



WO 01/74836 



PCT/US01/10472 



85 90 95 

Xaa Ser Gly Arg Trp Glu Arg Pro Ser Cys Cys Leu His Phe Ser Tyr 

100 105 110 

Pro Gin Leu Arg Gly Leu Cys 
115 



<210> 292 
<211> 354 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1) . . . (354) 

<223> Xaa = X or * as defined in Table 6 



<400> 292 



Arcr 


Glu 


Pro 


Ala Gly 


Ala 


Glv 


Ala 


1 






5 








Arcr 


Arcr 


Arcr 


Gly Asp 


Arcr 


Arcr 


Pro 








20 








Pro 


Ala 


Val 


Arg Ala 


Arg 


Ala 


Arcr 






35 








40 


Ala 


Ala 


Ala 


Gly Gly 


Gly 


Gly 


Ser 




50 








55 




Gly 


Glu 


Glu 


Tyr Gly 


He 


Met 


Ala 


65 








70 






Phe 


Ala 


Leu 


Leu Gly 


Thr 


His 


Gly 








85 








Phe 


Thr 


Thr 


Val Glu 


Asp 


Leu 


Gly 








100 








Leu 


Asn 


Asp 


Ser Ala 


Thr 


Glu 


Val 






115 








120 


Gly 


Val 


Val 


Leu Lys 


Glu 


Asp 


Ala 




130 








135 




Phe 


Lys 


Val 


Asp Ser 


Asp 


Asp 


His 


145 








150 






Phe 


Leu 


Pro 


Glu Pro 


His 


Gly 


His 








165 








.Gly 


Pro 


Pro 


Arg Val 


Glu 


Gly 


Leu 








180 








Thr 


Arg 


Gly 


Lys Thr 


Gly 


Leu 


Val 






195 








200 


Pro 


Pro 


Val 


Thr Asp 


Trp 


Ala 


Pro 




210 








215 




Pro 


Gin 


Gly 


Pro Ser 


Leu 


Asn 


Ala 


225 








230 






Pro 


Ser 


Ala 


Gly Pro 


Val 


Lys 


Ser 








245 








He 


Gly 


Pro 


Pro Pro 


Ala 


Arg 


Asn 








260 








Gly 


Leu 


Arg 


Pro Arg 


Pro 


Leu 


Gin 






275 








280 


Xaa 


Ala 


Ala 


Leu Trp 


Pro 


Phe 


Leu 




290 








295 




Leu 


Val 


Thr 


He He 


Phe 


He 


Tyr 


305 








310 






Val 


Leu 


Asp 


Asp Asp 


Asp 


Ala 


Gly 








325 








Gly 


Gin 


His 


Gin Asn 


Asp 


Lys 


Gly 



340 





Met Arg 


Ala 


Owe il a 


Arg 


val 




10 








15 






Arg 


Ser 


Pro 


Arg Pro 


Arg 


ASp 


25 








30 






Ser 


Ala 


Pro 


Pro 


Prn T ,Qi i 




He 










45 






Glv 


Trp 


Arg 


Leu 


Tyr Ala 




Ser 








60 








Phe 


Ala 


Leu 


Phe 


Val Leu 


Leu 


Glv 






75 








80 


Ala 


Ser 


Gly 


Ala 


Ala Gly 


Thr 


Val 




90 








95 




Ser 


Lys 


He 


Leu 


Leu Thr 


Cvs 


Ser 


105 








110 






Thr 


Gly His 


Arq 


Trp Leu 


Lvs 


Glv 










125 






Leu 


Pro Gly 


Gin 


Lys Thr 


Glu 


Ser 








140 








Val 


Gly Met 


Lys 


Tyr Ser 


Cys 


Val 






155 








160 


Gly 


Pro 


Thr 


He 


Gin Ala 


Ser 


Thr 




170 








175 




Xaa 


Ser 


Ser 


Phe 


Arg Thr 


His 


Ser 


185 








190 






Gly 


Ser 


Cys 


Lys 


Ser Glu 


Phe 


Val 










205 






Trp 


Tyr Lys 


He 


Thr Asp 


Ser 


Glu 








220 








Gin 


Arg 


Thr 


Arg 


Phe Phe 


Val 


Gly 






235 








240 


Tyr 


Gin His 


Xaa 


Glu Pro 


Glu 


His 




250 








255 




Arg 


Cys Asn 


Gly 


Thr Ser 


Ser 


Lys 


265 








270 






Phe 


Leu Arg 


Val 


Arg Thr 


Ala 


Thr 










285 






Gly 


He 


Val 


Gly 


Glu Val 


Leu 


Val 








300 








Glu 


Lys 


Arg 


Arg 


Lys Pro 


Glu 


Asp 






315 








320 


Ser 


Ala 


Pro 


Leu 


Lys Glu 


Ser 


Ala 




330 








335 




Lys 


Lys Arg 


Ser 


Ala Arg 


Gly 


Asn 


345 








350 
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Phe Ser 



<210> 293 
<211> 312 
<212> PRT 

<213:> Homo sapiens 



<400> 293 



Met 


Lys 


Val 


Leu 


Leu 


Glu 


Ser 


Val 


1 








5 








Leu 


Ala 


Ala 


Ala 


His 


Leu 


Arg 


Ser 








20 










Tyr 


Asp 


Ser 


Val 


Lys 


Lys 


Glu 


Lys 






35 










40 


Gin 


His 


Gin 


His 


Lys 


Gin 


Thr 


Arg 




50 










55 




Lys 


Leu 


Ser 


Leu 


Gly 


He 


Ala 


Phe 


65 










70 






Leu 


Asp 


Glu 


Pro 


Thr 


Ser 


Gly 


Val 










85 








Trp 


Asp 


lie 


Leu 


Leu 


Lys 


Tyr 


Arcr 








100 










Thr 


His 


His 


Leu 


Asp 


Glu 


Ala 


Glu 






115 










120 


Leu 


Gin 


His 


Gly 


Arg 


Leu 


Arg 


Cys 




130 










135 




Glu 


Ala 


Tyr 


Gly 


Gin Gly 


Leu 


Arg 


145 










150 






Val 


Leu 


Glu 


Ala 


His 


Asp 


Leu 


Lys 










165 








lie 


Lys 


He 


Tyr 


He 


Pro 


Gin 


Ala 








180 










Glu 


Leu 


Thr 


Tyr 


Thr 


He 


Pro 


Lys 






195 










200 


Gly 


Leu 


Phe 


Gin 


Ala 


Leu 


Asp 


Glu 




210 










215 




Gly 


Tyr 


Gly 


He 


Ser Asp 


Thr 


Thr 


225 










230 






Ala 


Ala 


Pro, 


Glu 


Pro 


Pro 


Met 


Leu 










245 








Arg 


Phe 


Ser 


Phe 


He 


Gin 


Val 


Val 








260 










Trp 


Val 


Gin 


Ala 


Gin Gly 


Ala 


Ser 






275 










280 


Glu 


Asp 


Leu 


Pro 


Ser 


Phe 


Pro 


Gin 




290 










295 




Lys 


Asp 


Pro 


His 


Gin 


Phe 


Ser 


Asn 


305 










310 







<210> 294 

<211> 581 

<212> PRT 

<213> Homo sapiens 



<400> 294 
Met Ala Ser His Ala Tyr Asp Lys 
1 5 



Lys 


Glu 


Arg 


Ala 


Glu 


Glu 


Glu Lys 




10 










15 


Phe 


Ala 


Ala 


Lys 


Lys 


Ala 


Lys Lys 


25 










30 




Thr 


Leu 


Gin 


Asp 


Val 


ASp 


Leu Thr 










45 






Ala 


Leu 


Ser 


Gly 


Gly 


Leu 


Lys Arg 








60 








Met 


Gly 


Met 


Ser 


Arg 


Thr 


Val Val 






75 








80 


Asp 


Pro 


Cys 


Ser 


Arcr 


His 


Ser Leu 




90 










95 


Glu 


Gly 


Arg 


Thr 


He 


He 


Phe Thr 


105 










110 




Ala 


Leu 


Ser 


Asp 


Arg 


Val 


Ala Val 










125 






Cys 


Gly 


Pro 


Pro 


Phe 


Cys 


Leu Lys 








140 








Leu 


Thr 


Leu 


Thr 


Arg 


Gin 


Pro Ser 






155 








160 


Asp 


Met 


Ala 


Cys 


Val 


Thr 


Ser Leu 




170 










175 


Phe 


Leu 


Lys 


Asp 


Ser 


Ser 


Gly Ser 


185 










190 




Asp 


Thr 


Asp 


Lys 


Ala 


Cys 


Leu Lys 










205 






Asn 


Leu 


His 


Gin 


Leu 


His 


Leu Thr 








220 








Leu 


Glu 


Glu 


Ala 


Glu 


Gly 


Arg Thr 






235 








240 


Glu 


Asp 


Gly 


His 


Ala 


Val 


Thr Gin 




250 










255 


Gly 


Cys 


Glu 


Asp 


Asp 


Arg 


Thr Thr 


265 










270 




Ala 


Pro 


Gly 


Gly 


Gin 


Arg 


Pro Gin 










285 






Asp 


Gly 


Arg 


Ser 


Arg 


Ala 


Gin Phe 








300 








Asn 


Gin 


Asn 


Ala 


Asn 


val 


Leu Val 



10 IS 
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His 


Leu 


Cys 


Phe 


Tyr 


Asn 


Arg 


He 


Pro 








20 










25 


Asp 


Ser 


Arg 


Ser 


Val 


Ser 


He 


Ser 


Tyr 






35 










40 




Met 


Gly 


Leu 


Glu 


Thr 


Ala 


Thr 


Ser 


Lys 




50 










55 






Ala 


Ser 


Thr 


Leu 


Leu 


Gly 


Met 


Leu 


Phe 


65 










70 








Leu 


Ser 


Leu 


Phe 


Ser 


Lys 


Lys 


Phe 


Gin 










85 










Ser 


Asp 


Asp 


Cys 


Leu 


Pro 


Pro 


Pro 


Pro 








100 










105 


Pro 


Ser 


Phe 


Leu 


Thr 


Trp 


Thr 


He 


Val 






115 










120 




Leu 


Leu 


Leu 


Ser 


Leu 


Pro 


Asn 


He 


Gin 




130 








135 






Gly 


Val 


Val 


Pro 


Ser 


Gin 


Pro 


Lys 


Ser 


145 










150 








Ser 


Gly 


Arg 


Val 


Val 


Lys 


Glu 


Lys 


Leu 










165 










Lys 


lie 


Lys 


Ser 


lie 


His 


He 


Leu 


Glu 








180 










185 


Glu 


Pro 


Ser 


Val 


Ser 


Arg 


Gin 


Leu 


Leu 






195 










200 




Ser 


Lys 


Glu 


Ala 


Asp 


Ser 


Trp 


Glu 


He 




210 










215 






Gin 


Thr 


Asn 


Val 


Gin 


Lys 


Pro 


Asp 


Lys 


225 










230 








Lys 


Arg 


Lys 


Trp 


Pro 


Leu 


Lys 


Gly 


Trp 










245 










Val 


His 


Gly 


Ser 


He 


Asp 


Val 


Gly 


Leu 








260 










265 


Lys 


Ala 


Arg 


Arg 


He 


Asp 


Leu 


Asp 


Thr 






275 










280 




Lys 


Val 


Lys 


Ser 


Val 


Phe 


Asn 


Ser 


Phe 




290 










295 






Asp 


Leu 


Pro 


Thr 


Pro 


Val 


Val 


Lys 


Ser 


305 










310 








Val 


Ser 


Lys 


Leu 


Arg 


His 


His 


Arg 


Leu 










325 










Arg 


Ser 


Pro 


Arg 


Asp 


Ala 


Ser 


Phe 


His 








340 










345 


Ala 


Glu 


Ser 


Ser 


Pro 


Ala 


Ala 


Asn 


Val 






355 










360 




Gin 


Pro 


Asn 


Ser 


Phe 


Pro 


Trp 


Gin 


Ser 




370 










375 






Leu 


Pro 


Ala 


Thr 


Cys 


Thr 


Thr 


Gly 


Gin 


385 










390 








Gin 


Asp 


Ser 


Glu 


Glu 


Met 


Asp 


Arg 


Cys 










405 










Gin 


Ser 


Asn 


Leu 


Val 


Glu 


Leu 


Ser 


Lys 








420 










425 


Leu 


Gin 


Arg 


Thr 


Gin 


Ser 


Ala 


Pro 


Asn 






435 










440 




Cys 


Val 


Asp 


He 


Ser 


Lys 


Lys 


Asp 


Lys 




450 










455 






Thr 


Lys 


Ser 


Val 


Ser 


Lys 


Asp 


Thr 


Lys 


465 










470 








Ser 


Ala 


Thr 


Met 


Ser 


Pro 


Val 


Arg 


Leu 










485 










Cys 


Ala 


Asp 


He 


Glu 


Phe 


Gin 


Thr 


Pro 








500 










505 


Leu 


Glu 


Ser 


Ser 


Thr 


Asp 


Tyr 


Thr 


Lys 






515 










520 
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Lys 


Thr 


Gly 


Ala 


Tyr 


Tyr 


Leu 










30 






Leu 


He 


Gly 


His 


His 


He 


Asp 








45 








Asn 


Glu 


Phe 


He 


Phe 


Asp 


Ser 






60 










Arg 


Lys 


Pro 


Ser 


Gin 


His 


Ser 




75 










80 


Glu 


Asn 


Leu 


He 


Tyr 


Leu 


Glu 


90 










95 




Pro 


Pro 


Pro 


Trp 


Ser 


Glu 


Pro 










110 






Thr 


Val 


Phe 


Gin 


Trp 


Val 


Ser 








125 








Val 


He 


Leu 


Tyr 


Arg 


Ala 


Val 






140 










Asp 


Asn 


Leu 


Lys 


Gly 


Trp 


Gly 




155 










160 


Arg 


Ser 


Glu 


He 


Pro 


Asp 


Trp 


170 










175 




Arg 


Thr 


Ala 


Ser 


Ser 


Ser 


Thr 










190 






Glu 


Pro 


Glu 


Pro 


Val 


Pro 


Leu 








205 








He 


Glu 


Gly 


Leu 


Lvs 


He 


Glv 






220 










His 


Glu 


Gly 


Phe 


Met 


Leu 


Lys 




235 










240 


His 


Lvs 


He 


Gin 


Lvs 


Glv 


Lys 


250 










255 




Ser 


Val 


Met 


Ser 


He 


Lvs 


Lvs 










270 






Glu 


Glu 


His 


He 


Tyr 


His 


Leu 








285 








Ser 


Ala 


He 


He 


Atq 


Gly 


Asn 






300 










Gin 


Asp 


Trp 

XT 


Phe 


Asp 


Ala 


Trp 




315 










320 


Tyr 


Arq 


Gin 


Asn 


Glu 


He 


Val 


330 










335 




He 


Phe 


Pro 


Ser 


Thr 


Ser 


Thr 










350 






Ser 


Val 


Met 


Asp 


Gly 


Lys 


Met 








365 








Pro 


Leu 


Pro 


Cys 


Ser 


Asn 


Ser 






380 










Ser 


Lys 


Val 


Ala 


Ala 


Trp 


Leu 




395 










400 


Ala 


Glu 


Asp 


Leu 


Ala 


His 


Cys 


410 










415 




Leu 


Leu 


Gin 


Asn 


Leu 


Glu 


He 










430 






Phe 


Thr 


Asp 


Met 


Gin 


Ala 


Asn 








445 








Arg 


Val 


Thr 


Arg 


Arg 


Trp 


Arg 






460 










He 


Gin 


Leu 


Gin 


Val 


Pro 


Phe 




475 










480 


His 


Ser 


Ser 


Asn 


Pro 


Asn 


Leu 


490 










495 




Pro 


Ser 


His 


Leu 


Thr 


Asp 


Pro 










510 






Leu 


Gin 


Glu 


Glu 


Phe 


Cys 


Leu 



525 
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He Ala Gin Lys Gly Lys Gly Ala Ser Lys Lys Gin Ala Lys Arg Asn 

530 535 540 

Ala Ala Glu Lys Phe Leu Ala Lys Phe Ser Asn He Ser Pro Glu Asn 
545 550 555 560 

His He Ser Leu Val Ser Asn Val Asp Ser Tyr Asp Val Asn Val He 

565 570 575 

Lys His Phe Leu Gin 
580 



<210> 295 
<211> 416 
<212> PRT 

<213> Homo sapiens 



<400> 295 



Met 


Leu 


Arg 


Thr 


Arq 


Lys 


Ala 


Pro 


His 


Ser 


Trp 


Val 


Lys 


Ser 


Ser 


Ser 


1 








5 










10 










15 




Asn 


Thr 


Val 


His 


Tyr 


Arq 


Val 


Ser 


Val 


Val 


Cvs 


Leu 


His 


Asp 


His 


Val 








20 










25 










30 






Thr 


Asp 


Trp 


Glu 


Trp 

IT 


Gin 


Leu 


Thr 


Ala 


Thr 


Ala 


Ara 


His 


Pro 


Lys 








35 










40 










45 








Val 


Ser 


His 


Tyr 


lie 


Leu 


Tro 


AST> 


Gin 


Glu 


Lys 


Thr 


Lys 


He 


Lys 


He 




50 










55 










60 










Arg 


Lys 


Asp 


He 


He 


Arg 


He 


Leu 


Pro 


Ser 


Leu 


Asp 


Val 


Glu 


Val 


Lys 


65 










70 










75 










80 


Asp 


He 


Thr 


Asp 


Ser 


Tyr 


Asp 


Ala 


Asn 


Trp 


Phe 


Leu 


Gin 


Leu 


Leu 


Ser 










85 










90 










95 




Thr 


Glu 


Asp 


Leu 


Phe 


Glu 


Met 


Thr 


Ser 


Lys 


Glu 


Phe 


Pro 


He 


Val 


Thr 








100 










105 










110 






Glu 


Val 


He 


Glu 


Ala 


Pro 


GlU 


Gly 


Asn 


His 


Leu 


Pro 


Gin 


Ser 


He 


Leu 






115 










120 










125 








Gin 


Pro 


Gly 


Lys 


Thr 


He 


Val 


He 


His 


Lys 


Lys 


Tyr 


Gin 


Ala 


Ser 


Arg 




130 










135 










140 










He 


Leu 


Ala 


Ser 


Glu 


He 


Arg 


Ser 


Asn 


Phe 


Pro 


Lys 


Arg 


His 


Phe 


Leu 


145 










150 










155 










160 


He 


Pro 


Thr 


Ser 


Tyr 


Lys 


Gly 


Lys 


Phe 


Lys 


Arg 


Arg 


Pro 


Arg 


Glu 


Phe 










165 










170 










175 




Pro 


Thr 


Ala 


Tyr 


Asp 


Leu 


Glu 


He 


Ala 


Lys 


Ser 


Glu 


Lys 


Glu 


Pro 


Leu 








180 










185 










190 






His 


Val 


Val 


Ala 


Thr 


Lys 


Ala 


Phe 


His 


Ser 


Pro 


His 


Asp 


Lys 


Leu 


Ser 






195 










200 










205 








Ser 


Val 


Ser 


Val 


Gly 


Asp 


Gin 


Phe 


Leu 


Val 


His 


Gin 


Ser 


Glu 


Thr 


Thr 




210 










215 










220 










Glu 


Val 


Leu 


Cys 


Glu 


Gly 


He 


Lys 


Lys 


Val 


Val 


Asn 


Val 


Leu 


Ala 


Cys 


225 










230 










235 










240 


Glu 


Lys 


He 


Leu 


Lys 


Lys 


Ser 


Tyr 


Glu 


Ala 


Ala 


Leu 


Leu 


Pro 


Leu 


Tyr 










245 










250 










255 




Met 


Glu 


Gly 


Gly 


Phe 


Val 


Glu 


Val 


He 


His 


Asp 


Lys 


Lys 


Gin 


Tyr 


Pro 








260 










265 










270 






He 


Ser 


Glu 


Leu 


Cys 


Lys 


Gin 


Phe 


Arg 


Leu 


Pro 


Phe 


Asn 


Val 


Lys 


Val 






275 










280 










285 








Ser 


Val 


Arg 


Asp 


Leu 


Ser 


He 


Glu 


Glu 


Asp 


Val 


Leu 


Ala 


Ala 


Thr 


Pro 




290 










295 










300 










Gly 


Leu 


Gin 


Leu 


Glu 


Glu 


Asp 


He 


Thr 


Asp 


Ser 


Tyr 


Leu 


Leu 


He 


Ser 


305 










310 










315 










320 


Asp 


Phe 


Ala 


Asn 


Pro 


Thr 


Glu 


Cys 


Trp 


Glu 


He 


Pro 


Val 


Gly 


Arg 


Leu 










325 










330 










335 




Asn 


Met 


Thr 


Val 


Gin 


Leu 


Val 


Ser 


Asn 


Phe 


Ser 


Arg 


Asp 


Ala 


Glu 


Pro 








340 










345 










350 






Phe 


Leu 


Val 


Arg 


Thr 


Leu 


val 


Glu 


Glu 


He 


Thr 


Glu 


Glu 


Gin 


Tyr 


Tyr 






355 










360 










365 
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Met Met Arg Arg Tyr Glu Ser Ser 

370 375 
Pro Lys His Pro Ser Val Glu Glu 
385 390 
Ala Glu Glu Arg Thr Val Asp Leu 
405 



Ala Ser His Pro Pro Pro Arg Pro 
380 

Thr Lys Leu Thr Leu Leu Thr Leu 
395 400 
Pro Lys Ser Pro Lys Arg Arg Arg 
410 415 



<210> 296 
<211> 302 
<212> PRT 

<213> Homo sapiens 



<400> 296 



Met 


Phe 


Ala 


Phe 


Glu Pro Leu 


Glv Glv Cvs 


Arcr 


Pro 


Tro 


Arg 


Leu 


Ser 


1 








5 


10 










15 




Leu 


Pro 


Gly 


Leu 


Gly Ser Arg 


Leu Phe Arg 


Thr 


Tvr 


Glv 


Ala 


Ala 


Asp 








20 




25 








30 






Gly 


Arg 


Arcr 


Gin 


Arg Arg Pro 


Gly Arg Glu 


Ala 


Ala 


Gin 




Phe 


Pro 






35 






40 






45 








Pro 


Gin 


Asp 


Arc 


Arg Arg Phe 


Phe Asn Ser 


Ser 


Glv 


Ser 


Ser 


Asp 


Ala 




50 






55 






60 










Arg 


Met 


Gly 


Asp 


Pro Ser Gin 


Ser Asp Asp 


Pro 


Asp 


Asp 


Pro 


Asp 


Asp 


65 








70 




75 










80 


Pro 


Asp 


Phe 


Pro 


Gly Ser Pro 


Val Arg Arg 


Arcr 


Arcr 


Arc? 


Cvs 


Pro 


Glv 










85 


90 










95 




Gly 


Arg 


Val 


Pro 


Lys Asp Arg 


Pro Ser Leu 


Thr 


Val 


Thr 


Pro 


Lvs 


Arcr 








100 




105 








110 






Trp 


Lys 


Leu 


Arg 


Ala Arg Pro 


Ser Leu Thr 


Val 


Thr 


Pro 


Arg 


Arg 


Leu 






115 






120 






125 








Gly 


Leu 


Arg 


Ala 


Arg Pro Pro 


Gin Lys Cys 


Ser 


Thr 


Pro 


Cys 


Gly 


Pro 




130 






135 






140 










Leu 


Arg 


Leu 


Pro 


Pro Phe Pro 


Ser Arg Asp 


Ser 


Gly 


Arg 


Leu 


Ser 


Pro 


145 








150 




155 










160 


Asp 


Leu 


Ser 


Val 


Cys Gly Gin 


Pro Arg Asp 


Gly 


Asp 


Glu 


Leu 


Gly 


lie 










165 


170 










175 




Ser 


Ala 


Ser 


Leu 


Phe Ser Ser 


Leu Ala Ser 


Pro 


Cys 


Pro 


Gly 


Ser 


Pro 








180 




185 








190 






Thr 


Pro 


Arg 


Asp 


Ser Val He 


Ser He Gly 


Thr 


Ser 


Ala 


Cys 


Leu 


Val 






195 






200 






205 








Ala 


Ala 


Ser 


Ala 


Val Pro Ser 


Gly Leu His 


Leu 


Pro 


Glu 


Val 


Ser 


Leu 




210 






215 






220 










Asp 


Arg 


Ala 


Ser 


Leu Pro Cys 


Ser Gin Glu 


Glu 


Ala 


Thr 


Gly 


Gly 


Ala 


225 








230 




235 










240 


Lys 


Asp 


Thr 


Arg 


Met Gly Ser 


Val Arg Val 


Leu 


Arg 


Asp 


Pro 


Val 


Gly 










245 


250 










255 




Val 


Asn 


Leu 


Tyr 


Glu His Ser 


Val Ser Lys 


Cys 


His 


Val 


Gly 


Gin 


Pro 








260 




265 








270 






Asp 


Thr 


Asp 


Pro 


Arg Glu Lys 


Val Lys Ala 


Ala 


Pro 


Glu 


Glu 


Leu 


Cys 






275 






280 






285 








Leu 


His 


Ala 


Leu 


Gin His Pro 


Arg Ser Glu 


Gin 


Ala 


Asp 


Cys 








290 






295 






300 











<210> 297 

<211> 98 

<212> PRT 

<213> Homo sapiens 
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<400> 297 

Gin Gly Ala Phe Trp Leu Leu Phe Ser Ser Pro Arg Ser Phe Phe Leu 

1 5 10 15 

Leu Ser Val Pro Trp Trp Leu Pro Glu Ser Ser Arg Trp Leu Leu Leu 

20 25 30 

His Gly Lys Ser Gin Leu Ala Val Gin Asn Leu Gin Lys Val Ala His 

35 40 45 

Arg Gly Asp Trp Pro Gly Ser Gly His Pro Ala Pro Gin Ser Gin His 

50 55 60 

Ser Ser Leu Arg Arg Ser Ala Ala Arg ser Arg Pro Pro Cys Trp Ala 
65 70 75 80 

Arg Arg Trp Arg Ala Pro Pro His Thr Pro Arg Val Ala Gly Gly Ser 
85 90 95 

Gly Cys 



<210> 298 
<211> 175 
<212> PRT 

<213> Homo sapiens 



<400> 298 

Glu Ser Val Thr Phe Glu Asp Val Ala Val Glu Phe lie Gin Glu Trp 

15 10 15 

Ala Leu Leu Asp Ser Ala Arg Arg Ser Leu Cys Lys Tyr Arg Met Leu 

20 25 30 

Asp Gin Cys Arg Thr Leu Ala Ser Arg Gly Thr Pro Pro Cys Lys Pro 

35 40 45 

Ser Cys Val Ser Gin Leu Gly Gin Arg Ala Glu Pro Lys Ala Thr Glu 

50 55 60 

Arg Gly lie Leu Arg Ala Thr Cys Val Ala Trp Glu Ser Gin Leu Lys 
65 70 75 80 

Pro Glu Glu Leu Pro Ser Met Gin Asp Leu Leu Glu Glu Ala Ser Ser 

85 90 95 

Arg Asp Met Gin Met Gly Pro Gly Leu Phe Leu Arg Met Gin Leu Val 

100 105 110 

Pro Ser lie Glu Glu Arg Glu Thr Pro Leu Thr Arg Glu Asp Arg Pro 

115 120 125 

Ala Leu Gin Asp Pro Pro Trp Ser Leu Gly Cys Thr Gly Leu Lys Ala 

130 135 140 

Ala Met Gin lie Gin Arg Val Val He Pro Val Pro Thr Leu Gly His 
145 150 155 160 

Arg Asn Pro Trp Val Ala Arg Asp Ser Gly Ala He Gly Asn Gly 
165 170 175 



<210> 299 
<211> 197 
<212> PRT 

<213> Homo sapiens 



<400> 299 
Pro Leu Pro Leu Asp Gin Arg Leu 

1 5 
Ser Gly Gin Ser He He Arg Thr 
20 

Lys Ala Asp Gly Ala Leu Lys Gly 
35 "* 40 



Leu Ala Ser He Thr Pro Ser Pro 

10 15 
Gin Pro Gly Ala Gly Val His Pro 

25 30 
Glu Ala Glu Gin Ser Ala Gly His 
45 
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Pro 


Ser 
50 


Glu 


His 


Leu 


Phe 


He 
55 


Cys 


Val 


Pro 


Cys 


Thr 


Ala 


Ala 


Arg 


Pro 


65 










70 






Gin 


Arg 


Cys 


Leu 


Cys 
85 


Ser 


Ala 


Glu 


Leu 


Cys 


Cys 


Val 
100 


Lys 


Gly 


Leu 


Phe 


Asp 


Asn 


Cys 
115 


Ala 


Asp 


Glu 


Pro 


Cys 
120 


Val 


Arg 
130 


Trp 


Ala 


Ala 


Met 


Ser 
135 


Leu 


Cys 


Cys 


Tyr 


Leu 


Pro 


Thr 


Arg 


Gly 


145 










150 






Tyr 


Asp 


Ser 


Leu 


Arg 
165 


Arg 


Pro 


Gly 


Thr 


Val 


Cys 


Arg 
180 


Lys 


He 


Ser 


Ser 


Gin 


Glu 


Lys 


Ser 


Val 









195 



Glu 


Glu Cys 


Gly Arg Cys 


Lys 


Cys 






60 






Leu 


Pro Ser 


Cys Trp Leu 


Cys 


Asn 




75 






80 


Ser 


Leu Leu 


Asp Tyr Gly 


Thr 


Cys 




90 




95 




Tyr 


His Cys 


Ser Thr Asp 


Asp 


Glu 


105 




110 






Ser 


Cys Gly 


Pro Ser Ser 


Cys 


Phe 






125 






He 


Ser Leu 


Phe Leu Pro 


Cys 


Leu 






140 






Cys 


Leu His 


Leu Cys Gin 


Gin 


Gly 




155 






160 


Cys 


Arg Cys 


Lys Arg His 


Thr 


Asn 




170 




175 




Gly 


Ser Ala 


Pro Phe Pro 


Lys 


Ala 


185 




190 







<210> 300 
<211> 523 
<212> PRT 

<213> Homo sapiens 



<400> 300 



Met 


Lys 


Thr 


Cys 


1 








Gly 


Ser 


Ala 


Val 








20 


Ser 


Pro 


Leu 


Pro 






35 




Glu 


Tyr 


Phe 


Arg 




50 






Lys 


Arg 


Thr 


Ser 


65 








Phe 


Arg 


Ser 


Ala 


Arg 


Arg 


Val 


Pro 








100 


Leu 


Lys 


Lys 


Ala 






115 




Leu 


His 


Asn 


Cys 




130 






Val 


Gly 


Arg 


Ser 


145 








Ser 


Ser 


His 


His 


Tyr 


He 


Gly 


Gly 








18 0 


Thr 


Gly 


Leu 


Glu 






195 




Ala 


Val 


Asp 


Lys 




210 






Leu 


Leu 


Cys 


Arg 


225 








Gly 


Gly 


Cys 


Phe 


Val 


Leu 


Ser 


val 



260 



Leu 


Ala 


Phe 


Pro 


5 








Leu 


Arg 


Leu 


Gin 


Ser 


Val 


Glu 


Lys 








40 


Lys 


His 


Pro 


Pro 






55 




Pro 


Gly 


Pro 


Gly 




70 






Glu 


Ala 


Cys 


Gly 


85 








Val 


Pro 


Leu 


Ser 


Leu 


Arg 


Pro 


Glu 








120 


Glu 


Arg 


Asn 


Trp 






135 




Glu 


Ser 


Gin 


Thr 




150 






Arg 


Gly 


Thr 


Val 


165 








Trp 


Cys 


Leu 


Cys 


Gin 


Leu 


Leu 


Phe 








200 


Thr 


Trp 


Arg 


Thr 






215 




His 


cys 


He 


Thr 




230 






Leu 


Ser 


Pro 


Ser 


245 








He 


Val 


Ala 


Ser 



Leu Ala Phe His 
10 

Ala Thr His Gly 
25 

Ala Ser His Ser 

Val Arg Arg Ser 
60 

Ala Arg Val Pro 
75 

Val Ala Ala Leu 
90 

Ser Pro Ala Glu 
105 

Ser Arg Pro Phe 

Gly Asn Lys Gly 
140 

Gly Gin Ser Glu 
155 

Arg Glu Glu Leu 
170 

Gly Ser Ala Trp 
185 

Ser Arg Thr Arg 

Ala Arg Gin Leu 
220 

Leu Pro Trp Pro 
235 

Asn Met Leu Val 
250 

Pro Glu Pro Ser 
265 



His Asp 


Ser 


Asn 




15 




He Ser 


Ser 


Asp 


30 






Pro Asp 


Pro 


Ser 


45 






Gly Leu 


Arg 


Thr 


Gly Ser 


Gin 


Ser 






80 


Glu Cys 


Trp 


Arg 




95 




Val Gin 


Val 


Leu 


110 






Arg Asn 


Gin 


He 


125 






Trp Lys 


Gly 


Leu 


Lys Leu 


Ser 


Met 






160 


Val Val 


Glu 


Glu 




175 




Lys Leu 


Leu 


Val 


190 






Pro Gin 


Glu 


Glu 


205 






Glu Ser 


Gly 


Thr 


Ser Glu 


Arg 


Asn 






240 


Cys Glu 


Leu 


Arg 




255 




Thr Glu 


His 


Thr 


270 
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Gin 


Glu 


His 


Leu Ser Gly 


Asp Glu 






275 




280 


Lys 


Glu 


Lys 


Ser Leu Gly 


Leu Leu 




290 






295 


Pro 


Asn 


Tyr 


Pro Asn Pro 


Ala Val 


305 






310 




Val 


Ala 


Glu 


Glu Leu Asn 


Val Glu 








325 




Asn 


Val 


Leu 


Glu Ser Leu 


His Met 








340 




Tyr 


Thr 


Trp 


His Gly Arg 


His Asn 






355 




360 


Lys 


Ser 


lie 


Gly Glu Glu 


Asn Lys 




370 






375 


Lys 


Lys 


Lys 


Glu Tyr Glu 


Gin Glu 


385 






390 




Ser 


Val 


Asn 


Ser Arg Lys 


Asp Lys 








405 




Phe 


Val 


Met 


Leu Phe Leu 


Val Ser 








420 




Val 


Ala 


Ala 


Lys He Leu 


He Gly 






435 




440 


Lys 


Ser 


Lys 


Phe Lys Thr 


Gly Ser 




450 






455 


Leu 


Ser 


Gly 


Ala Gly Ser 


Lys Leu 


465 






470 




Phe 


Glu 


Val 


Ser Leu Ala 


Ala Phe 








485 




Phe 


Glu 


Pro 


Val Ala Leu 


Glu Glu 








500 




Arg 


Pro 


Trp 


Gly Gly Val 


Thr Leu 






515 




. 520 



Phe 


Glu 


Lys 


Ser 


Gin Pro 


Ser 


Arg 










285 






Cys 


His 


Lys 


Phe 


Leu Ala 


Arg 


Tyr 








300 








Asn 


Asn 


Asp 


He 


Cys Leu 


Asp 


Glu 






315 








320 


Arg 


Arg 


Arg 


He 


Tyr Asp 


He 


Val 




330 








335 




Val 


Ser 


Arg 


Leu 


Ala Lys 


Asn 


Arg 


345 








350 






Leu 


Asn 


Lys 


Thr 


Leu Gly 


Thr 


Leu 










365 






Tyr 


Ala 


Glu 


Gin 


lie Met 


Met 


He 








380 








Phe 


Asp 


Phe 


He 


Lys Ser 


Tyr 


Thr 






395 








400 


Ser 


Leu 


Arg 


Val 


Met Ser 


Gin 


Lys 




410 








415 




Thr 


Pro 


Gin 


He 


Val Ser 


Leu 


Glu 


425 








430 






Glu 


Asp 


His 


Val 


Glu Asp 


Leu 


Asp 










445 






Leu 


Val 


Arg 


Leu 


Phe Ala 


Pro 


Cvs 








460 








Pro 


Gly 


Leu 


Val 


Glu Ala 


Leu 


Ala 






475 








480 


Ser 


Val 


Val 


Ala 


Ser Val 


Glu 


Ser 




490 








495 




Trp 


Val 


Val 


His 


Thr Val 


Gly 


Leu 


505 








510 






Trp 


Trp 


Ala 











<210> 301 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1)...(81) 

<223> Xaa = X or * as defined in Table 6 



<400> 301 



Met 


Asp 


Leu 


He 


Val Tyr 


His Lys Lys 


Ser Asp 


He 


Ser 


Asn 


Gin 


Pro 


1 








5 




10 








15 




Ser 


He 


Pro 


Thr 


Cys Ala 


Leu Phe Phe 


Pro Cys 


Val 


Ser 


Leu 


Glu 


Pro 








20 




25 








30 






Phe 


Gin 


Leu 


Phe 


Pro Val 


Lys Gin Thr 


Ala Arg 


Arg 


Pro 


Pro 


Pro 


Tyr 






35 






40 






45 








Ser 


Ser 


Pro 


Gly 


Lys Ser 


Thr Gly Asn 


Val He 


Pro 


Phe 


Gly 


His 


Gly 




50 








55 




60 










Phe 


Pro 


Thr 


Leu 


Gin Pro 


Lys Xaa Gin 


He Thr 


Pro 


Val Gly 


Gly 


Gin 


65 








70 




75 










80 



<210> 302 
<211> 585 
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<212> PRT 

<213> Homo sapiens 
<220> 

<22X> misc_f eature 
<222> (1) . . . (585) 

<223> Xaa - X or * as defined in Table 6 



<400> 302 



Met 


Arg 


Gin 


Thr 


Lys 


Thr 


Glu 


Tyr 


lie 


Gin 


Glu 


Phe 


Asn 


Gin Glu Ala 


1 








5 










10 








15 


Thr 


Val 


Ala 


Arg 


Ala 


Leu 


Glu 


Gly 


Gin 


Glu 


Lys 


Pro 


Thr 


Glu Gly Pro 








20 










25 










30 


Arg 


Asn 


Thr 


Cys 


Leu 


Gly 


Ser 


Asn 


Asn 


Met 


Tyr 


Asp 


He 


Phe Asn Leu 






35 










40 










45 




Asn 


Asp 


Lys 


Ala 


Leu 


Cys 


Phe 


Thr 


Lys 


Cys 


Arg 


Gin 


Ser 


Gly Ser Asp 




50 










55 










60 






Ser 


Cys 


Asn 


Val 


Glu 


Asn 


Leu 


Gin 


Arg 


Tyr 


Trp 


Leu 


'Asn 


Tyr Glu Ala 


65 










70 










75 






80 


His 


Leu 


Met 


Lys 


Glu 


Gly 


Leu 


Thr 


Gin 


Lys 


Val 


Asn 


Thr 


Pro Phe Leu 










85 










90 








95 


Lys 


Ala 


Leu 


Val 


Gin 


Asn 


Leu 


Ser 


Thr 


Asn 


Thr 


Ala 


Glu 


Asp Phe Tyr 








100 










105 










110 


Phe 


Ser 


Leu 


Glu 


Pro 


Ser 


Gin 


Val 


Pro 


Arg 


Gin 


Val 


Met 


Lys Asp Glu 






115 










120 










125 




Asp 


Lys 


Pro 


Pro 


Asp 


Arg 


Val 


Arg 


Leu 


Pro 


Lys 


Ser 


Leu 


Phe Arg Ser 




130 










135 










140 






Leu 


Pro 


Gly 


Asn 


Arg 


Ser 


Val 


Val 


Arg 


Leu 


Ala 


Val 


Thr 


He Leu Asp 


145 










150 










155 






160 


lie 


Gly 


Pro 


Gly 


Thr 


Leu 


Phe 


Lys 


Gly 


Pro 


Arg 


Leu 


Gly 


Leu Gly Asp 










165 










170 








175 


Gly 


Ser 


Gly 


Val 


Leu 


Asn 


Asn 


Arg 


Leu 


Val 


Gly 


Leu 


Ser 


Val Gly Gin 








180 










185 










190 


Met 


His 


Val 


Thr 


Lys 


Leu 


Ala 


Glu 


Pro 


Leu 


Glu 


He 


Val 


Phe Ser His 






195 










200 










205 




Gin 


Arg 


Pro 


Pro 


Pro 


Asn 


Met 


Thr 


Leu 


Thr 


Cys 


Val 


Phe 


Trp Asp Val 




210 










215 










220 






Thr 


Lys 


Ala 


Leu 


Val 


Gly 


Gly 


Tyr 


Asp 


He 


Leu 


Ala 


He 


Tyr Glu Val 


225 










230 










235 






240 


Glu 


His 


Phe 


Gin 


Gin 


Glu 


Gin 


Cys 


Val 


Ala 


Val 


He 


Ser 


Val Cys Ser 










245 










250 








255 


Arg 


Gly 


Gly 


Lys 


Gly 


Ser 


Ala 


Glu 


Cys 


Gly 


His 


Trp 


Gly 


Lys Gly Leu 








260 










265 










270 


Thr 


Thr 


Glu 


His 


Ser 


Ser 


Pro 


Val 


Pro 


Ala 


Val 


Thr 


His 


Leu Ser Leu 






275 










280 










285 




Ala 


Arg 


lie 


Ala 


Glu 


Lys 


Gly 


Lys 


Ala 


Gly 


Cys 


Pro 


Ala 


Arg Ala Cys 




290 










295 










300 






Arg 


Val 


His 


Ser 


Trp 


Val 


Leu 


Val 


Leu 


Ser 


Gly 


Lys 


Arg 


Glu Val Pro 


305 










310 










315 






320 


Glu 


Asn 


Phe 


Phe 


lie 


Asp 


Pro 


Phe 


Thr 


Gly 


His 


Ser 


Tyr 


Ser Thr Gin 










325 










330 








335 


Asp 


Glu 


His 


Phe 


Leu 


Gly 


lie 


Glu 


Ser 


Leu 


Trp 


Asn 


His 


Lys Asn Tyr 








340 










345 










350 


Trp 


lie 


Asn 


Met 


Gin 


Asp 


Cys 


Trp 


Asn 


Cys 


Cys 


Lys 


Val 


Pro Arg Glu 






355 










360 










365 




Gly 


Glu 


Leu 


Gly 


Asp 


Pro 


Glu 


Glu 


Arg 


Leu 


Ala 


His 


Leu 


Ser Trp Trp 




370 










375 










380 






Leu 


Gly 


Leu 


Ser 


Val 


His 


Leu 


His 


Gly 


Gly 


Arg 


Ser 


Gin 


Ser Ala Ala 


385 










390 










395 






400 


Pro 


Glu 


Leu 


Pro 


Ser 


His 


His 


Pro 


Val 


Pro 


Ala 


Ala 


Leu 


Met Val Tyr 










405 










410 








415 


Glu 


Pro 


Leu 


Pro 


Ala 


Thr 


Thr 


Gly 


Leu 


Asp 


Leu 


Xaa 


Pro 


Gly Xaa Pro 








420 










425 










430 
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Cys 


Glu 


Met 


Gly Val His 


Ala Pro Gly 


Asp 


Xaa 


Xaa Val Ser 


Ala 


Val 






435 




440 






445 






Leu 


Asp 


Xaa 


Arg Arg Arg 


Gin Trp Asp 


Lys 


Arg 


Xaa Gly Xaa 


Cys 


Gly 




450 






455 






460 






Lys 


Ser 


Gly 


Gin Gly Gly 


Xaa Gly Xaa 


Glu 


Leu 


Arg His Ala 


Pro 


Leu 


465 






470 






475 






480 


Val 


Val 


Glu 


Gin He Glu 


He Ser Pro 


Glu 


Gly 


Thr Asn He 


Leu 


Glu 








485 




490 






495 




He 


Lys 


Glu 


Trp Tyr Gin 


Asn Arg Glu 


Asp 


Met 


Leu Glu Leu 


Lys 


His 








500 


505 






510 






He 


Asn 


Lys 


Thr Thr Asp 


Leu Lys Thr 


Asp 


Tyr 


Phe Lys Pro 


Gly 


His 






515 




520 






525 






Pro 


Gin 


Ala 


Leu Arg Val 


His Ser Tyr 


Lys 


Ser 


Met Gin Pro 


Glu 


Met 




530 






535 






540 






Asp 


Arg 


Val 


He Glu Phe 


Tyr Glu Thr 


Ala 


Arg 


Val Asp Gly 


Leu 


Met 


545 






550 






555 






560 


Lys 


Arg 


Glu 


Glu Thr Pro 


Arg Thr Met 


Thr 


Glu 


Tyr Tyr Gin 


Gly 


Arg 








565 




570 






575 




Pro 


Asp 


Phe 


Leu Ser Tyr 


Arg His Ala 













580 585 



<210> 303 
<211> 457 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1) . . . (457) 

<223> Xaa = X or * as defined in Table 6 



<400> 303 



Gly 


Asp 


Gin 


Lys 


Val 


His 


Pro 


Phe 


Ser 


Thr 


Pro 


Ser Pro Gly 


Thr 


Pro 


1 








5 










10 






15 




Ala 


Phe 


His 


He 


Pro 


Thr 


Thr 


Phe 


Ser 


Pro 


Ala 


Ala Gly Pro 


Gly 


His 








20 










25 






30 






His 


Leu 


Pro 


Met Asp 


Pro 


Gly 


Glu 


Gly 


Leu 


Ala 


Glu Gly Pro 


Gly 


Leu 






35 










40 








45 






Pro 


Gly 


Ser 


Ser Gly 


Xaa 


Arg 


Pro 


Leu 


Xaa 


Val 


Pro Ser Arg 


Arg 


Ala 




50 










55 










60 






Ser 


His 


Cys 


Pro 


Pro 


Gly 


Ala 


Thr 


Lys 


Ala 


Arg 


Gly Gly Arg 


Cys 


Arg 


65 










70 










75 






80 


Gly 


Pro 


Ala 


Ala 


Thr 


Thr 


Gly 


Xaa 


Ala 


Ala 


Cys 


Ala Gly Arg 


Thr 


Ala 










85 










90 






95 




Ala 


Pro 


Gly 


Xaa 


Pro 


Gly 


Ala 


Ser 


Pro 


Pro 


Ala 


Ala Gin Ala 


Leu 


His 








100 










105 






110 






His 


Ser 


Leu 


Gin 


Glu 


Pro 


Gly 


Glu 


His 


Arg 


Gly 


Arg Pro Gly 


Pro 


Ser 






115 










120 








125 






Ala 


Ser 


Ala 


Pro 


Ser 


Ala 


Gly 


Thr 


Val 


Asp 


Gin 


Val Gly Gly 


Gly 


Ala 




130 










135 










140 






Glu 


Arg 


Met 


Pro 


Thr 


Thr 


Pro 


Gly 


Pro 


Arg 


His 


Ala Val Gly 


Glu 


Cys 


145 










150 










155 






160 


Gly 


Pro 


Thr 


Cys 


Ser 


Ala 


Ser 


Leu 


Arg 


Gly 


Pro 


Leu Xaa Pro 


Leu 


Pro 










165 










170 






175 




Asn 


Leu 


Ala 


Ala 


Pro 


Ala 


Gin 


Trp 


Gly 


Ser 


His 


Gin Leu Gin 


Gly 


Glu 








180 










185 






190 






Glu 


Gin 


He 


Gin 


Val 


Pro 


Ser 


Cys 


Cys 


Phe 


Ala 


Pro Gly He 


Gin 


Arg 






195 










200 








205 






Leu 


Leu 


Pro 


Arg 


Pro 


Gin 


Thr 


Gin 


Glu 


Pro 


Gly 


Phe Xaa Thr 


Gin 


Thr 




210 










215 










220 






Pro 


Asp 


Pro 


Gly Leu 


Lys 


Pro 


Gin 


Asp 


Ser 


Lys 


Pro Arg Leu 


Pro 


Gly 
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225 








230 


235 


240 


Leu 


Gin 


Thr 


Gin 


Thr Pro Asp Pro 


Gly Pro Arg Thr Arg Ala 


Asp Gly 










245 


250 


255 


Phe 


Pro 


Asp 


Gin 


Arg Gly Pro Val 


Gly Gin Gly Gin Trp Glu 


Gly Ala 








260 




265 270 




Pro 


Gly 


Gly 


His 


Thr Leu Gly Asn 


Ser Gly Gly Ser Cys Leu 


Ala Gly 






275 




280 


285 




Pro 


Pro 


Trp 


Xaa 


Arg Ser Glu Gly 


His Glu Glu Cys Pro Ser 


Ser Cys 




290 






295 


300 


Gin 


Ser 


Gin 


Phe 


Gly Glu Leu Arg 


Leu Trp Leu Pro Arg Gly 


Gly Trp 


305 








310 


315 


320 


Ala 


Glu 


Gly 


Val 


Ser Ala Gly Ser 


His Gly Pro Pro Trp Pro 


Ala Gly 










325 


330 


335 


Pro 


Ala 


Pro 


Pro 


Gly Pro Gin Pro 


Leu Gly Trp Asp Ala Gly 


Pro His 








340 




345 350 




Phe 


Pro 


Glu 


Glu 


Ser Arg Thr Arg 


Pro Gly Pro Asp Pro Glu 


Pro Xaa 






355 




360 


365 




I*ys 


Asp 


His 


Gly 


Thr Val Leu Xaa 


Leu Thr Gin Arg Lys His 


Arg Asp 




370 






375 


380 




Gly 


His 


Lys 


Glu 


Pro Arg Thr Lys 


He Gin Leu Pro Val Pro 


Gly Ala 


385 








390 


395 


400 


Glu 


Gly 


Gin 


Thr 


Cys Pro Pro Glu 


Pro Trp Ala Gly Ala His 


Ara Ara 










405 


410 


415 


Asn 


Ala 


Asn 


Trp 


Gin Ala Gin Gly 


Ser Arg Arg Glu Arg Pro 


Ser Gly 








420 




425 430 




Phe 


Gin 


Thr 


Pro 


Arg Ser His Trp 


Val Pro Ser Ala Gly Arg 


Ser Gly 






435 




440 


445 




Phe 


Leu 


Gly 


Pro 


Gin Phe Ser Cys 


Leu 






450 






455 







<210> 304 
<211> 42 
<212> PRT 

<213> Homo sapiens 



<400> 304 

Met Ser Gly Arg Val Phe Arg Cys Gin Ala Leu Val Ala Tyr Thr Val 

15 10 15 

Leu Ser Glu Leu Phe Thr Glu Ala Lys Glu Gin Arg Leu Ala Thr Asp 

20 25 30 

Glu Gly Gin Lys Glu Phe Ser Ala Glu Ser 
35 40 



<210> 305 
<211> 41 
<212> PRT 

<213> Homo sapiens 



<400> 305 

Met Ser Gly Arg Val Phe Arg Cys Gin Ala Leu Val Ala Tyr Thr Val 

15 10 15 

Leu Ser Glu Leu Phe Thr Glu Ala Lys Glu Gin Arg Leu Ala Thr Asp 

20 25 30 

Glu Gly Gin Lys Glu Phe Ser Ala Glu 
35 40 



200 



WO 01/74836 PCT/US01/10472 



<210> 306 
<211> 827 
<212> PRT 

<213> Homo sapiens 



<400> 306 



Gly 


Lys 


Tyr 


Tyr 


Lys 


Leu 


Ser 


Ser 


Gly 


Thr 


Ala 


Pro 


Thr 


Cys 


Val 


Ser 


1 








5 










10 










15 




Leu 


Gly 


Trp 


Gly 


Leu 


Ala 


Arg 


Gly 


Asp 


Ser 


Ala 


Ala 


Pro 


Ala 


Leu 


Gly 








20 










25 










30 






Ser 


Arg 


Thr 


Ser 


Ala 


Cys 


Ala 


Pro 


Cys 


Ser 


His 


Gly 


Thr 


Trp 


Lys 


Leu 






35 










40 










45 








Ser 


Leu 


Glu 


Pro 


Ser 


Asp 


Arg 


Leu 


Ser 


Pro 


Cys 


Asp 


Arg 


Ser 


Ser 


Glu 




50 










55 










60 










Glu 


Ala 


His 


Thr 


His 


Ala 


Pro 


His 


Arg 


Leu 


Leu 


Ala 


Leu 


Val 


Ala 


Ser 


65 










70 










75 










80 


Leu 


Pro 


Trp 


Ser 


Arg 


Leu 


Pro 


Leu 


Leu 


Ala 


Pro 


Gin 


Ser 


His 


Ser 


Glu 










85 










90 










95 




Ala 


Glu 


Ala 


Thr 


Ser 


Gin 


Pro 


Thr 


Gly 


Val 


Glu 


Asn 


His 


His 


Gin 


Lys 








100 










105 










110 






Thr 


Arg 


Tyr 


Val 


Lys 


Ala 


Gly 


Gly 


Pro 


Val 


He 


Cys 


Ara 


Ser 


Leu 


Pro 






115 










120 










125 








Glu 


Ser 


Arg 


Gly 


Phe 


Leu 


Trp 


Ala 


Ser 


Glu 


Gly 


Arg 


Lys 


Cys 


Met 


Leu 




130 










135 










140 










lie 


Gly 


Ser 


Trp 


Ala 


Ala 


Met 


Gly 


Arg 


Leu 


Arq 


Lys 


Ser 


Thr 


He 


Ser 


145 










150 










155 










160 


Ser 


Arg 


Phe 


Gly 


Pro 


Gin 


Thr 


Leu 


Ala 


Gly 


Thr 


Gly 


Ara 


Pro 


Gin 


Ala 










165 










170 










175 




He 


Pro 


Val 


Leu 


Lys 


Lys 


His 


Ser 


Asp 


Ala 


Val 


Leu 


Leu 


Gly 


Val 


Cvs 








180 










185 










190 






Phe 


Leu 


Lys 


Leu 


Leu 


His 


Gin 


His 


His 


Gin 


Glu 


Leu 


Gly 


Glu 


Asn 


Ala 






195 










200 










205 








Asp 


Ser 


Gin 


Thr 


Leu 


Pro 


Gin 


Thr 


His 


Trp 


Glu 


Phe 


He 


Leu 


Ser 


Glu 




210 










215 










220 










Asp 


Tyr 


Asn 


Lys 


Met 


Thr 


Pro 


Val 


Lys 


Asn 


Tyr 


Gin 


Val 


Leu 


Glu 


Val 


225 










230 










235 










240 


Leu 


Ala 


Arg 


Ala 


Met 


Arg 


Gin 


Glu 


Lys 


Gin 


He 


Lys 


Ser 


He 


Gin 


Leu 










245 










250 










255 




Gly 


Lys 


Glu 


Glu 


Val 


Lys 


Leu 


Ser 


Val 


Phe 


Ala 


Asp 


Asp 


Met 


He 


Val 








260 










265 










270 






Tyr 


Leu 


Glu 


Asn 


Pro 


He 


Val 


Ser 


Ala 


Gin 


Asn 


Leu 


Leu 


Lys 


Leu 


He 






275 










280 










285 








Ser 


Asn 


Phe 


Ser 


Lys 


Val 


Ser 


Gly 


Tyr 


Lys 


He 


Asn 


Val 


Gin 


Lys 


Ser 




290 










295 










300 










Gin 


Ala 


Phe 


Leu 


Tyr 


Thr 


Asn 


Asn 


Arg 


Gin 


Thr 


Glu 


Ser 


Gin 


He 


He 


305 










310 










315 










320 


Ser 


Glu 


Leu 


Pro 


Phe 


Thr 


He 


Pro 


Ser 


Lys 


Arg 


He 


Lys 


Tyr 


Leu 


Gly 










325 










330 










335 




He 


Gin 


Leu 


Thr 


Arg 


Asp 


Val 


Lys 


Asp 


Leu 


Phe 


Lys 


Glu 


Asn 


Tyr 


Lys 








340 










345 










350 






Pro 


Leu 


Leu 


Asn 


Glu 


He 


Lys 


Glu 


Asp 


Thr 


Asn 


Lys 


Trp 


Lys 


Asn 


He 






355 










360 










365 








Pro 


Cys 


Ser 


Trp 


Val 


Gly 


Arg 


He 


Asn 


He 


Met 


Lys 


Met 


Ala 


He 


Leu 




370 










375 










380 










Pro 


Arg 


Val 


He 


Tyr 


He 


Phe 


Asn 


Ala 


He 


Ser 


He 


Lys 


Leu 


Pro 


Met 


385 










390 










395 










400 


Thr 


Phe 


Phe 


Thr 


Glu 


Leu 


Glu 


Lys 


Thr 


Thr 


Leu 


Lys 


Phe 


He 


Trp 


Asn 










405 










410 










415 




Gin 


Lys 


Arg 


Ala 


Arg 


lie 


Ala 


Lys 


Thr 


He 


Leu 


Ser 


Gin 


Lys 


Asn 


Lys 








420 










425 










430 






Ala 


Gly 


Gly 


He 


Thr 


Leu 


Pro 


Asp 


Phe 


Lys 


Leu 


Tyr 


Tyr 


Lys 


Ala 


Thr 






435 










440 










445 








Val 


Thr 


Lys 


Thr 


Ala 


Trp 


Tyr 


Trp 


Tyr 


Gin 


Asn 


Arg 


Gly 


Val 


Asp 


Gin 
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450 










455 










460 






Trp 


Asn 


Arg 


He 


Glu 


Pro 


Ser 


Glu 


He 


He 


Pro 


His He His 


Asn 


His 


465 










470 










475 






480 


Leu 


He 


Phe 


Asp 


Lys 


Pro 


Asp Lys 


Asn 


Lys 


Lys 

2 


Trp Gly Lys 

fr 2 2 


Asp 

IT 


Ser 










485 










490 






495 




Leu 


Phe 


Thr 


Lys 


Trp 


Cys 


Trp Glu 


Asn 


Trp 


Leu 


Ala He Cys 


Arcr 


Lys 








500 










505 






510 






Leu 


Lys 


Leu 


Asp 


Pro 


Phe 


Leu Thr 


Pro 


Tyr 


Thr 


Lys He Asn 


Ser 


Thr 






515 










520 








525 






Trp 


He 


Lys 


Asp 


Leu 


Asn 


Val Arg 


Pro 


Lys 


Thr 


He Lys Thr 


Leu 


Glu 




530 










535 










540 






Glu 


Asn 


Leu 


Gly 


He 


Thr 


He 


Gin 


Asp 


He 


Gly 


Met Gly Lys 


Asp 


Phe 


545 










550 










555 






560 


Met 


Ser 


Lys 


Thr 


Pro 


Lys 


Ala 


Met 


Ala 


Thr 


Lys 

2 


Ala Lys He 


ASP 


Lvs 

2 a 










565 










570 






575 




Trp 


Asp 


Leu 


lie 


Lys 


Leu 


Lys 


Ser 


Phe 


Cys 


Thr 


Ala Lys Glu 


Thr 


Thr 








580 










585 






590 






He 


Arg 


Val 


Asn 


Arg 


Gin 


Pro 


Thr 


Glu 


Trp 


Glu 


Lys He Phe 


Thr 


He 






595 










600 








605 






Tyr 


Pro 


Ser 


Asp 


Lys 


Gly 


Leu 


He 


Pro 


Arcr 


He 


Tyr Lys Glu 


Leu 


Lys 




610 










615 










620 






Gin 


Asn 


Leu 


Gin 


Glu 


Lys 


He Lys 


Gin 


Pro 


His 


Gin Lys Val 


Glv 


Lys 


625 










630 










635 






640 


Gly 


Tyr 


Lys 


Gin 


Thr 


Phe 


Leu Lys 


Arcr 


Arcr 


His 


Leu Cvs Self 


Gin 


Gin 










645 










650 






655 




Thr 


His 


Glu 


Lys 


Met 


Phe 


He 


He 


Thr 


Gly 


His 


Gin Arg Asn 


Ala 


Lys 








660 










665 






670 






Gin 


Asn 


His 


Asn 


Lys 

2 


He 


His 


Leu 


Thr 


Pro 


Val 


Arcr Met Ala 


He 


He 






675 










680 








685 






Lys 


Lys 


Ser 


Gly 


Asn 


Asn 


Arg Asp 


Met 


ASD 


Glu 


Ala Gly Asn 


His 


His 




690 










695 










700 






Ser 


Glu 


Gin 


Thr 


He 


Ala 


Arg 


Thr 


Glu 


Asn 


Gin 


Ala Pro Tyr 


Leu 


Leu 


705 










710 










715 






720 


Thr 


His 


Arg 


Trp 


Glu 


Leu 


Asn 


Asn 


Glu 


Asn 


Thr 


Tro Thir Gin 


Val 


Glu 










725 










730 






735 




Glu 


His 


His 


Thr 


Leu 


Gly 


Pro 


He 


Val 


Gly 


Val 


He Cys Arg 


Lys 


Val 








740 










745 






750 






Phe 


Pro 


Gly 


Asn 


Ser 


Gly 


Pro 


Ser 


Lvs 


Pro 


Ser 


Glv Leu His 


Phe 


Ser 






755 










760 








765 






Gin 


Pro 


Leu 


Pro 


Gin 


Val 


Thr 


Ser 


Val 


Val 


Ala 


Lys He Thr 


He 


Val 




770 










775 










780 






Pro 


Trp 


Glu 


Met 


Lys 


Leu 


He 


Ala 


Met 


Gly 


Val 


Gin Asp Glu 


Leu 


Asn 


785 










790 










795 




800 


He 


Ala 


Phe 


His 


Lys 


Asn 


His 


Leu 


Leu 


Met 


Asn 


Asp Thr Thr 


He 


His 










805 










810 






815 




Met 


Thr 


Pro 


Tyr 


He 


Gin 


Pro Ala 


Pro 


Lys 


Ser 









820 825 



<210> 307 
<211> 135 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc__f eature 
<222> (1) . . . (135) 

<223> Xaa = X or * as defined in Table 6 



<400> 307 

Thr Pro Leu Pro Val Cys His Phe Thr Cys Arg Lys Asn His Leu Lys 
15 io 15 
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Gly 


Met 


Glu 


Asn 
20 


Leu Cys Leu 


His 


Lys 

• 25 


Lys 


Cys 


Met 


Trp 


Met 
30 


Ser 


Thr 


Val 


Ala 


Phe 
35 


Ser 


He He Ala 


Lys 
40 


Thr 


Trp 


Lys 


Gin 


Pro 
45 


Arg 


Cys 


Pro 


Ser 


Ala 
50 


Ala 


Pro 


Ser Trp Lys 
55 


Gin 


Pro 


Thr 


His 


Leu 
60 


Thr 


Thr 


Gly 


Asp 


Trp 


Ala 


Asn 


Gly 


Leu Gly Xaa 


Phe 


Ser 


Thr 


Arg 


Glu 


Tyr 


Val 


Thr 


Ala 


65 








70 








75 










80 


Xaa 


Glu 


Arg 


Thr 


Asn Gin ser 
85 


Lys 


Pro 


Asp 
90 


Thr 


Thr 


Thr 


Trp 


Val 
95 


Asn 


Leu 


Thr 


Asp 


Val 
100 


Gin Leu Ser 


Asn 


Ser 
105 


Ser 


Gin 


Ala 


Pro 


Arg 
110 


Gly 


Val 


Ser 


Thr 


Thr 
115 


Leu 


Gin Phe Pro 


Val 
120 


Leu 


Gly 


Thr 


Val 


Asp 
125 


Lys 


Ser 


Gly 


Val 


Thr 


Met 


Thr 


Phe Trp Val 





















130 135 



<210> 308 
<211> 313 
<212> PRT 

<213> Homo sapiens 



<400> 308 



Met 


Gly 


Asn 


Lys 


1 








Phe 


Thr 


Leu 


His 








20 


Phe 


Glu 


Ser 


Ser 






35 




Gin 


Thr 


Ser 


Pro 




50 






Arg 


Leu 


Thr 


Ala 


65 








Ser 


Ala 


He 


His 


Ser 


Asp 


Leu 


Ala 








100 


Arg 


Arg 


Val 


Lys 






115 




Gin 


Ser 


Asn 


Gly 




130 






Ser 


Leu 


Glu 


Lys 


145 








His 


Ser 


Pro 


Ser 


Ser 


Ala 


Glu 


His 








180 


Leu 


Gin 


Pro 


Gin 






195 




Tyr 


Thr 


Arg 


Ser 




210 






Gly 


Leu 


Leu 


Gly 


225 








Tyr 


Pro 


Gly 


Phe 


Gly 


Leu 


Glu 


Asn 








260 


Pro 


Met 


Gly 


Glu 






275 




Leu 


Ala 


Ser 


Pro 




290 







Thr 


Tyr Gly 


Gly 


5 








Ser 


Leu 


Phe 


Leu 


Ser 


Gin Lys 


Pro 








40 


Ser 


Leu Arg 


Lys 






55 




Ser 


Lys 


Ser 


Val 




70 






Val 


Asn Gly 


Asn 


85 








Tyr 


Leu Arg 


Leu 


Val 


Ser His 


Ala 








120 


Pro 


Val 


Ser 


He 






135 




Ala 


Thr 


Ala 


Ala 




150 






Trp 


Glu 


Ala 


Arg 


165 








Asn 


Ser 


Leu 


Ser 


Leu 


Leu 


Pro 


Thr 








200 


Pro 


Gin 


Ser 


Leu 






215 




Thr 


Met 


Leu 


Arg 




230 






Leu 


Pro Trp 


Cys 


245 








Glu 


Cys 


Lys 


Val 


Pro 


Glu Gly 


Arg 








280 


Gly 


Ser 


Pro 


Leu 






295 





Gin Asn Gin Met 
10 

Asn Ser Gly Asp 
25 

Gly Gly Phe Arg 

His Phe Pro Val 
60 

Glu Glu Met Pro 
75 

Leu Ser Glu Gin 
90 

Ala Gin His Leu 
105 

Phe Leu Pro Arg 

Cys Leu Glu Ala 
140 

He Gin Val Pro 
155 

Glu Ser Ala Leu 
170 

Arg Pro Ser Asp 
185 

Leu Cys Leu Pro 

Gly His Gly Pro 
220 

Ser Arg Arg Thr 
235 

Ser Gly Arg He 
250 

Ser Leu Ser Gly 
265 

Trp Ser Ser Pro 

He Ala Trp Ala 
300 



Leu 


He 


Phe 


Ala 






15 




Gly 


Arg 


Leu 


Ser 




30 






Asn 


He 


Ala 


He 


45 








Phe 


Lys 


Arg 


Lys 


Thr 


Ala 


Ser 


Gin 








80 


Asp 


He 


Val 


Ser 






95 




Glu 


Asp 


Gly 


Pro 




110 






Val 


Pro 


Lys 


Val 


125 








Gly 


Thr 


Trp 


Arg 


Asp 


Asp 


He 


Tyr 








160 


Ser 


Pro 


Asp 


Arg 






175 




Pro 


Gly 


Leu 


Ser 




190 






Phe 


His 


Val 


Leu 


205 








He 


Ala 


Val 


His 


Trp 


Ser 


Phe 


Leu 








240 


Gly 


Ser 


Arg 


Val 






255 




Ser 


Ser 


Ser 


Gin 




270 






Glu 


Val 


Gly 


Pro 


285 








Lys 


Leu 


Arg 


Phe 
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Val Pro Pro Val Asp Asp 
305 310 



<210> 309 
<211> 509 
<212> PRT 

<213> Homo sapiens 



<400> 309 



Met 


Asp 


Ser 


Gin 


Glu 


Val 


1 








5 




Glu 


Glu 


He 


Pro 


Phe 


Ser 








20 






Ala 


Leu 


Glu 


His 


Glv 


Lys 






35 








Leu 


Phe 


Cys 


Phe 


Ser 


Ser 




50 










Lys 


Val 


Phe 


Leu 


Lys 


Asn 


65 










70 


Phe 


His 


Leu 


Leu 


Thr 


Met 










85 




Val 


Glu 


Pro 


Val 


Ser 


Pro 








100 






Leu 


Glu 


Glu 


Leu 


Tr"D 

■ LJ -Jr' 


Thr 






115 








Gin 




Asp 




Glu 


Lys 




13 0 










His 


Gin 


Glv 


Val 


Glv 


Lys 


145 










150 


Lys 


Ala 


Glv 


He 


Val 


Val 










165 




Phe 


Met 


Gly 


He 


Leu 


Ser 








180 






Glu 


Lys 


lie 


He 


Leu 


Arg 






195 








Phe 


Gly 


Met 


Val 


He 


Phe 




210 










Ser 


Gly 


Lys 


Thr 


Lys 


Phe 


225 










230 


Thr 


Leu 


Val 


Leu 


Trp 


He 










245 




He 


Met 


Arg 


Gly 


His 


Ser 








260 






Tyr 


Ser 


Arg 


Lys 


Ala 


He 






275 








Glu 


Glu 


Leu 


Gly 


Gin 


He 




290 










Leu 


Thr 


Gin 


Asn 


lie 


Met 


305 










310 


He 


Tyr 


Gly 


Glu 


Val 


His 










325 




Gin 


Leu 


He 


His 


Arg 


Trp 








340 






Lys 


Asn 


Gin 


Thr 


Asp 


Thr 






355 








Asp 


Leu 


Ala 


Glu 


Thr 


Gin 




370 










Tyr 


Thr 


Asn 


Lys 


Leu 


Glu 


385 










390 


Ala 


Tyr 


Thr 


Leu 


Ser 


Arg 



405 



Leu 


Pro 


Val 








Glu 


Lys 


Tyr 


Pro 


Asn 


Thr 








10 






Gly 


He 


His 


Val 


Ala 


Gly 






25 








Asp 


Asp 


Leu 


Asp 


Glu 


Pro 




40 










Phe 


Ser 


Asn 


Ala 


Leu 


Ala 


55 










60 


He 


His 


He 


Leu 


Gin 


Phe 










75 




Ala 


Lys 


Ala 


Lys 


Phe 


Glu 








90 






Ser 


Gin 


Pro 


Lys 


Arg 


Pro 






105 








Arg 


Leu 


Thr 


Lys 


Gly 


Asn 




120 










Gly 


Gly 


Trr> 


Met 


Lys 


Gly 


135 










140 


Gin 


Glu 


Glu 


Gly 


Ser 


Glu 










155 




Cys 


Glu 


Val 


Pro 


Asn 


Asn 








170 






Trp 


Lys 


Asp 


Ser 


Lys 


His 






185 








Gly 


Cys 


He 


Leu 


Arg 


Asn 




200 










Ala 


Gly 


Pro 


Asp 


Thr 


Lys 


215 










220 


Lys 


Arg 


Thr 


Ser 


He 


Asp 










235 




Met 


Leu 


He 


Ser 


Gin 


Pro 








250 






Tyr 


Phe 


He 


Asn 


Trp 


Asp 






265 








Pro 


Ala 


Val 


Ala 


Arg 


Thr 




280 










Glu 


Tyr 


He 


Phe 


Ser 


Asp 


295 










300 


Thr, Phe 


Lys 


Arg 


Cys 


Ser 










315 




Asp 


Asp 


Leu 


Asp 


Gin 


Lys 








330 






Leu 


Ala 


Arg 


Leu 


Lys 


Lys 






345 








He 


Lys 


Asn 


Asp 


Lys 


Gly 




360 










Thr 


Thr 


He 


Arg 


Glu 


Tyr 


375 










380 


Asn 


Leu 


Glu 


Glu 


Met 


Asp 










395 




Leu 


Asn 


Gin 


Glu 


Glu 


Val 
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Ser 


Val 


Ala 


Cys 






15 




Glv 


Lys 


Ser 


Gly 




30 






He 


Glu 


Asn 


Pro 


45 








lie 


TtAn 
JLtCSU 


L6U 


pro 


He 


Tyr 


Arg 


Ser 








80 


V3j. y 


rLLd. 


ai n 


Ser 






y z> 






Tyr 




Pro 




110 






Coy- 


Arg 


pro 




125 








Val 




Gin 


V3JL y 


Asn 


He 


Lys 


Glu 








160 


Lys 


Leu 


Asp 


Lys 






175 




Ser 


Leu 


Asn 


Asn 




190 






Thr 


Ser 


Tr~D 


Cvs 


205 








Leu 


Met 


Gin 


Asn 


Arg 


Leu 


Met 


Asn 








240 


Val 


Val 


Glu 


Phe 






255 




Arg 


Lys 


Met 


Tvr 
j 




270 






Thr 


Thr 


Leu 


Asn 


285 








Lys 


Thr 


Gly 


Thr 


He 


Asn 


Gly 


Arg 








320 


Thr 


Glu 


He 


Thr 






335 




Lys 


Lys 


Arg 


Glu 




350 






Asn 


He 


Thr 


Thr 


365 








Tyr 


Lys 


His 


Leu 


Lys 


Phe 


Leu 


Asp 








400 


Glu 


Ser 


Leu 


Ser 






415 
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Arg Pro 


He 


Thr Ser Ser Glu He 






420 


Thr Lys 


Lys 


Ser Pro Gly Pro Asp 




435 


440 


Lys Tyr 


Lys 


Glu Glu Leu Glu Lys 


450 




455 


Ser Gin 


Ala 


Asp Arg Glu Phe Gin 


465 




470 


Ser lie 


Lys 


Met Gly Asp Pro Lys 






485 


Ala Leu 


Cys 


His Thr Val Met Ser 



500 



Glu 


Ala 


Val 


He 


Asn Ser Leu Pro 


425 








430 


Arg 


Phe 


Thr 


Ala 


Glu Leu Tyr Gin 










445 


Glu 


Pro 


Val 


Asp 


Phe Ser Val Lys 








460 




Phe 


Phe 


Asp His Asn Leu Met Glu 






475 




480 


Val 


His 


Glu 


Phe 


Leu Arg Leu Leu 




490 






495 


Glu 


Glu 


Asn 


Ser 


Ala 


505 











<210> 310 
<211> 70 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<222> (1) . . . (70) 

<223> Xaa = X or * as defined in Table 6 



<400> 310 

Asp Trp Thr Val Gly Phe Val Gly Asn Ser Asp Thr Glu Leu Pro Gly 

1 5 10 15 

Ser Val Gly Arg Arg Ser Leu Trp Glu Ser Ser Tyr Ser Thr Arg Thr 

20 25 30 

Arg Asn Gin Gly Arg Gin Ala He Gin He Gin His Ser Xaa Leu Arg 

35 40 45 

Glu Val Glu Arg Lys Ser Gly Gin Lys Ala Thr Met Ser Ser Gly Gly 

50 55 60 

Gly Tyr Cys Gin Pro Glu 
65 70 



<210> 311 
<211> 250 
<212> PRT 

<213> Homo sapiens 



<400> 311 



Ala He 


Arg 


Gin 


Glu 


Lys Glu 


He 


1 






5 






Glu Val 


Lys 


Leu 


Ser 


Leu Phe 


Ala 






20 








Asn Pro 


He 


Val 


Ser 


Ala Gin 


Asn 




35 








40 


Ser Lys 


Val 


Ser Gly 


Tyr Lys 


He 


50 








55 




Leu Tyr 


Thr 


Asn 


Asn 


Arg Gin 


Thr 


65 








70 




Pro Phe 


Thr 


He 


Ala 


Ser Lys 


Arg 








85 






Thr Arg 


Asp 


Val 


Lys 


Asp Leu 


Phe 






100 








Lys Glu 


He 


Lys 


Glu 


Asp Arg 


Asn 




115 








12 0 


Trp Val 


Gly 


Arg He 


Asn Met 


Val 



Lys Gly He Gin Leu Gly Lys Glu 

10 15 
Asp Asp Met He Leu Tyr Leu Glu 

25 30 
Leu Leu Lys Leu He Ser Asn Phe 
45 

Asn Val Gin Lys Ser Gin Ala Phe 
60 

Glu Ser Gin He Met Ser Glu Leu 
75 80 
He Lys Tyr Leu Gly He Gin Leu 

90 95 
Lys Glu Asn Tyr Lys Leu Leu Leu 
105 110 
Lys Trp Lys Asn He Pro Cys Ser 
125 

Lys Met Ala He Leu Pro Lys Val 
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130 








13 5 




He 


Tyr 


Gly 


Phe Asn 


Ala 


lie 


Pro 


145 








150 






Thr 


Glu 


Leu 


Glu Lys 


Thr 


Thr 


Ser 








165 








Ala 


His 


He 


Ala Lys 


Ser 


He 


Leu 








180 








lie 


Thr 


Pro 


Pro Asp 


Phe 


Lys 


Leu 






195 








200 


Thr 


Ala 


Trp 


Thr Arg 


Lys 


He 


Tyr 




210 








215 




He 


Ser 


Val 


Glu Pro 


Val 


Tyr 


Ser 


225 








230 






Gin 


Leu 


Val 


Pro Leu 


Leu 


Cys 


Thr 



245 



140 



He 


Lys 


Leu 


Pro 


Met He 


Phe Phe 






155 






160 


Lys 


Phe 


He 


Trp 


Asn Gin 


Lys Arg 




170 








175 


Ser 


Gin 


Lys 


Asn 


Lys Ala 


Gly Gly 


185 








190 




Tyr 


Tyr 


Lys 


Ala 


Thr Val 


Thr Lys 










205 




Ser 


Ala 


Lys 


Lys 


Arg Lys 


Val Lys 








220 






Gly 


Val 


Thr 


Leu 


Thr Thr 


Ala He 






235 






240 


Ala 


Leu 











250 



<210> 312 
<211> 297 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1) . . . (297) 

<223> Xaa » X or * as defined in Table 6 



<400> 312 



Met 


Asp 


Tyr 


Glu 


Lys 


Ala 


Asp 


Lys 


1 








5 








Ala 


Lys 


Ser 


Pro 
20 


Leu 


Gly 


Leu 


Val 


He 


Gin 


Glu 
35 


Ser 


He 


Val 


Ala 


Asn 
40 


Val 


Leu 


Pro 


Gly Pro 


Pro 


Glu 


Asp 




50 










55 




Pro 


Gin 


Thr 


Gin Lys 


Ala 


Tyr 


Met 


65 










70 






Asn 


Val 


Thr 


Ala 


Asn 
85 


Asn 


Asn 


Met 


Gin 


Leu 


Pro 


Tyr Ala 


Tyr 


Tyr 


Arg 








100 










He 


Met 


Ser 
115 


Met 


Glu 


Thr 


Arg 


His 
120 


Tyr 


Ser 


Phe 


Gly Asp 


Ser 


Gly 


Asn 




130 










135 




Pro 


Ala 


Ala 


Asp Ala 


Val 


Arg 


Gly 


145 










150 






Thr 


Ser 


He 


Pro 


Ser 
165 


Ser 


Gly 


He 


Glu 


Asn 


Lys 


Asn 
180 


Leu 


Glu 


His 


Leu 


Thr 


Thr 


Leu 
195 


Gin 


He 


Arg 


Pro 


Ala 
200 


Phe 


Leu 
210 


Ala 


Pro 


Val 


Lys 


Thr 
215 


Glu 


Ser 


Gly 


Pro 


Asp 


Cys 


Ala 


Gly 


Ser 


225 










230 






Tyr 


Gin 


Arg 


Ala 


Pro 
245 


Gin 


Met 


Pro 


Glu 


Glu 


Leu 


He 


Leu 


Arg 


Glu 


Lys 



260 



Arg 


Pro 


Thr 


Pro 


Trp Glu 


Ala Ala 




10 






15 


Asp 


Asp 


Ala 


Phe 


Gin Pro 


Lys Asn 


25 








30 




Val 


Val 


Ser 


Ala 


Ala Arg 


Arg Lys 










45 




Trp 


Asn 


Glu 


Arg 


Leu Ser 


Tyr He 








60 






Gly 


Ser 


Cys 


Gly 


Arg Gin 


Glu Tyr 






75 






80 


Ser 


Thr 


Thr 


Ser 


Gin Tyr 


Gly Ser 




90 








95 


Gin 


Ala 


Ser 


Arg 


Asn Asp 


Ser Ala 


105 








110 




Leu 


Tyr 


Thr 


Arg 


Gin Leu 


Tyr Cys 










125 




Phe 


Cys 


Glu 


Asn 


Thr Asn 


Gly Arg 








140 






Leu 


Thr 


He 


Leu 


Ser Leu 


Ser Thr 






155 






160 


Ser 


Glu 


Ala 


Leu 


He Ser 


Glu Asn 




170 








175 


Thr 


His 


Gly 


Gly 


Tyr Val 


Glu Ser 


185 








190 




Thr 


Lys 


Thr 


Gin 


Cys Thr 


Glu Phe 










205 




Val 


Pro 


Leu 


Ala 


Glu Asn 


Gin Arg 








220 






Leu 


Lys 


Glu 


Glu 


Thr Gly 


Pro Ser 






235 






240 


Asp 


Ser 


Gin 


Arg 


Gly Arg 


Val Ala 




250 








255 


Val 


Glu 


Ala 


Ser 


Thr Gin 


Asn Asn 


265 








270 
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Tyr Tyr Val Gly Glu Leu Thr Gly Val Thr Leu Gin Asn Gly Tyr Gly 

275 280 285 

Glu Lys Pro lie Leu Ala Thr Gin Xaa 
290 295 



<210> 313 
<211> 325 
<212> PRT 

<213> Homo sapiens 



<400> 313 



Met 


Leu 


Lvs 


Tvr 

* Jr "*• 


Thr 


Gly Ala 


H -5 s Gin Glu Val Glu 


ucu o ci Aid rlw 


1 








5 






10 


15 


lie 


Val 


Thr 


Lvs 


Met 


Ala 


Thr 


Gin Tvi" Leu Aro Glu 


■e%£9i* licit trilKl 








20 








25 


30 


Arg 


Phe 


Asp 


Asn 


Asp 


Asn 


Phe 


PVK Ti^U T\C*1 1 21 en fil v 
\— jr o ucu jjcu noil v_j_i_ y 


-f-^ts^J IXJLeL V Ct_L lie 






35 










40 


45 


Phe 


Arg 


Met 


Tvr 
j 


Val 


Ser Trp 


u y o uc u vai ui u. jjyo 


wjlu iii y ixii vjj_u 




50 










55 


60 




lie 


Met 


Leu 


Lys 


TVT 


Thr Gly 


Ala His Gin Glu Thr- 

je\-i-a. nio vJJ-ii V3J. n i IIJ_ 


ixp iicu Juy o iisp 


65 










70 




75 


ftfl 

O v 


Leu 


Glu 


Glu 


Ser 


Pro 


Leu Tyr 


Gl u TVl ^ Ti^n Qpr Mot* 

VJtJL Li AXtt 1ICU OCX. net 


.MXy JLy bill i-Lfap 










85 






90 


95 


Lys 


Glu 


Thr 


Leu 


Glv 


Leu Trp 


II^ Gin Leu Pro Tr"n 

ju J. c u j-ii jjwu riu up 


fves Drn Trn ml t J" 








100 








105 


110 


Lvs 


Ala 


Val 


Gin 


Met 


His 


Met 


A sti Pm fi^r* fipr Plie* 


ulil IicU >i.i>|J 1 III. 






115 










120 


125 


Lvs 


Pro 


Glv 


Lys 


Glv 


Glu 


Leu 


Ala Glv A~rcr Tif*u Tl 


Tip Dfrt TT-i cj nl Tl 

■IXC; tfJL\J Xllb 




130 










135 


140 




Glu 


Ala 


Ser 


He 


Leu 


Glu 


Leu 


Ser* Leu Leu Leu Mc*»t 


Thl" Pvq Val 


145 










150 




155 


160 


Glu 


Arg 


Glu 


Gly 


Lys 


Thr 


Ser 


Val Arg Val Ala Ala 


Val Gly Glu Cys 










165 






170 


175 


Thr 


Ala 


Ser 


Glu 


Thr 


Pro 


Asn 


Gin Gly Ala Gly Arg 


Leu Ser Leu Trp 








180 








185 


190 


Gin 


Gin 


Leu 


Thr 


Ser 


Lys 


Lys 


Glu Thr lie Met Glu 


Lys Glu His Thr 






195 










200 


205 


Asp 


Cys 


Val 


Ser 


Gin 


Thr 


Val 


Ala Leu lie Ser Thr 


Cys Val Lys Glu 




210 










215 


220 


Gly 


Gly 


Ser 


Arg 


Pro 


Ala Asp 


Lys Asp Leu Glu Glu 


Gly Gly Gly Leu 


225 










230 




235 


240 


Glu 


Ala 


Glu 


Ser 


Pro 


Lys 


Gin 


Ser Pro Asn Leu Cys 


Val lie Leu Arg 










245 






250 


255 


His 


Asn 


Leu 


Ala 


Ser 


Arg Pro 


Gly Gin Leu Ala Leu 


Val Thr Val Gly 








260 








265 


270 


Thr 


Met 


Gin 


Gly 


Arg 


Pro 


Leu 


Ser His Ser Ser Glu 


Val Lys Gly Thr 






275 










280 


285 


Thr 


Phe 


Val 


Thr 


His 


Ser 


Val 


Pro Ala Gly Lys Glu 


Lys Asp Glu Glu 




290 










295 


300 




Arg 


Gly 


lie 


Gly 


Asp 


Leu 


Glu 


His Ala Arg Asp Leu 


Arg Asn Ser Pro 


305 










310 




315 


320 


Thr 


Pro 


Leu 


Phe 


Tyr 











325 



<210> 314 
<211> 301 
<212> PRT 
<213> Homo sapiens 
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<400> 314 



Met 


Ser 


Glu 


Leu 


Pro 


Phe Thr He 


l 








5 




Gly 


He 


Gin 


Leu 


Thr 


Arg Asp Val 








20 






Lys 


Pro 


Leu 


Leu 


Asn 


Glu He Lys 






35 






40 


He 


Pro 


Cys 


Ser 


Trp 


Val Gly Arg 




50 








55 


Leu 


Pro 


Gin 


Arg 


Leu 


Pro His Gly 


65 










70 


Val 


Val 


Asp 


Lys 


Arg 


Asn Pro Arg 










85 




Asp 


Val 


Asp 


Asp 


Gin 


Arg Val Lys 








100 






Ser 


Gly 


Val 


Ser 


Ala 


Asp Leu Asn 






115 






120 


Leu 


Lys 


Thr 


Ala 


Leu 


Ala Val Ser 




130 








135 


Val 


Ser 


Gly 


Leu 


He 


Phe Ser Ser 


145 










150 


Gly 


He 


Met 


Arg 


Glu 


Asp Leu Gly 










165 




Phe 


Pro 


Met 


Ala 


Leu 


Ser Lys Ser 








180 






Gin 


Leu 


Asp 


Ala 


Leu 


Ser Gin Asp 






195 






200 


Ser 


Lys 


Lys 


Gly Thr 


Gly Tyr Pro 




210 








215 


Glu 


Tyr 


Leu 


Glu 


Ala 


Thr Gin Thr 


225 










230 


Lys 


Met 


Asp 


Ser 


Asp 


Val Gly Glu 










245 




Arg 


Pro 


Asn 


Tyr 


Ser 


Gin Tyr Thr 








260 






Ser 


Tyr 


Lys 


Glu 


Asn 


Val Phe Leu 






275 






280 


Asp 


He 


Ala 


Glu Arg 


Phe Asp Cys 




290 








295 



Ala 


Ser Lys Arg He 


Lys 


Tyr 


Leu 




10 




15 




Lys 


Asp Phe Phe Lys 


Glu Asn 


Tyr 


25 




30 






Glu 


Glu Thr Asn Lys 


Trp Lys Asn 




45 








He 


Asn He Val Lys 


Met 


Ala 


lie 




60 








Phe 


Leu Pro Asn Met 


Lys 


Leu 


Glu 




75 






80 


Leu 


He Arg Val Ala 


Thr 


He 


Val 




90 




95 




His 


Ser Met Thr Ala 


Ser Ser Gly 


105 




110 






Thr 


Ala Ser Gin Pro 


Leu Trp 


Leu 




125 








Ser 


Ser Val Lys Val 


His 


Pro 


Pro 




140 








Ser 


Arg Thr Leu Leu 


Ser 


Phe 


Met 




155 






160 


Phe 


Ser Arg Arg Gin 


He 


Leu 


His 




170 




175 




Ala 


Gly Arg Arg Ser 


Lys 


He Gly 


185 




190 






Phe 


Gly Leu Arg Asp 


Arg 


Asp 


Ser 




205 








Asn 


Pro Glu Asn Phe 


Ser 


Trp 


Thr 




220 








Asn 


Ala Val Pro Ala 


Lys 


Val 


Phe 




235 






240 


Asn 


Arg Lys He Leu 


Arg Asp 


Glu 




250 




255 




Pro 


Phe Ser Arg Cys 


Asp 


Asn 


Ala 


265 




270 






Gin 


Lys Leu Glu Arg 


Asn 


Thr 


Pro 




285 








Leu 


Leu Leu Thr Tyr 









300 



<210> 315 
<211> 80 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1) . . . (80) 

<223> Xaa = X or * as defined in Table 6 



c400> 315 

Met Met Ser Trp Ser Ser Ala Glu Lys Gly Pro Glu Gly His Arg Arg 

15 10 15 

Arg Glu Trp Pro Ser Gin Trp Glu Asp Glu Pro Xaa A3n Gin Asn Gly 

20 25 30 

Glu Ser Arg Lys Lys Glu Arg Lys Glu Lys Arg Arg Lys Glu Thr Xaa 

35 40 45 

Glu Glu Arg Arg Gly Glu Lys Arg Glu Lys Arg Glu Lys Lys Arg Arg 

50 55 60 

Arg Glu Gly Arg Met Ser He Ser Phe Ala Arg Arg Tyr Ser He Leu 



208 



WO 01/74836 

65 



70 



75 



PCT/US01/10472 

80 



<210> 316 
<211> 106 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1) . - . (106) 

<223> Xaa = X or * as defined in Table 6 



<400> 316 



Ser 


Gin 


Arg 


Thr 


Ala Gly Asn Pro 


Cys Leu His Pro Val Ser Leu Cys 


1 








5 




10 15 


Gly 


Ser 


Ala 


Ser 


Trp 


Met Pro Met 


lie Met Pro Gin Arg Trp Ser Ser 








20 






2 5 30 


Leu 


Cys 


Ser 


Ala 


Met 


Glu Lys Pro 


Ala Ser Pro Cys Leu Xaa Met Pro 






35 






40 


45 


Pro 


Gin 


Ala 


Thr 


Cys 


Trp Cys Pro 


Ser Arg Leu Pro Met Ala Trp Ala 




50 








55 


60 


Ser 


Gly 


His 


xaa 


His 


Thr Ser Thr 


Gly His Ser Gin Leu Pro Ala He 


65 










70 


75 80 


Pro 


Phe 


Asp 


Asn 


His 


Cys Gly Lys 


Arg Cys Arg Leu Gly Gly Lys Trp 










85 




90 95 


Arg 


Ala 


Pro 


Leu 


Gin 


His Pro Gin 


Trp Lys 








100 






105 



<210> 317 
<211> 89 
<212> PRT 

<213> Homo sapiens 



<400> 317 



Arg 


Arg 


Pro Thr 


Arg 


Pro Gin Glu 


Glu Gly Gly Ser Glu Ser Ser Thr 


1 






5 




10 15 


Met 


Thr 


Glu Leu 


Glu 


Thr Ala Met 


Gly Leu He He Asp Val Phe Ser 






20 






25 30 


Arg 


Tyr 


Ser Gly 


Ser 


Glu Gly Ser 


Thr Gin Thr Leu Thr Lys Gly Glu 






35 




40 


45 


Leu 


Lys 


Val Leu 


Met 


Glu Lys Glu 


Leu Pro Gly Phe Leu Gin Leu Ser 




50 






55 


60 


Gly 


Pro 


Gly Leu 


Gly 


His Gin His 


Thr Leu Leu Leu Leu Phe Arg Ser 


65 








70 


75 80 


Ala 


Ser 


Trp Ser 


Arg 


Leu Val Pro 


Gin 








85 







<210> 318 
<211> 151 
<212> PRT 

<213> Homo sapiens 



<400> 318 
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Met 


Val 


Glu 


Gly Lys 


Glu Glu Gin 


1 






5 




Arg Ala 


Cys 


Ala Gly 


lie Arg Gly 








20 




Val 


Ala 


Arg 


Val His 


He Gly Gin 






35 




40 


Leu 


Gin 


Asn 


Lys Glu 


His Val He 




50 






55 


Lys 


Phe 


Pro 


Gly Arg 


Gin Asn He 


65 








70 


Thr 


Lys 


Phe 


Ser Val 


Asp Glu Phe 








85 




Leu 


lie 


Pro 


Asp Asn 


Cys Gly Val 








100 




Pro Asp 


Lys 


Arg Asp 


Gly Val Ala 






115 




120 


Gin 


Leu 


Leu 


Gin Asn 


Lys He Arg 




130 






135 


Pro 


Pro 


Lys 


Lys Ala 


Arg Arg 


145 








150 



Val Thr Ser Tyr Val Asp Val Gin 

10 15 
Ala Phe Glu Lys Pro Gin Gly Ala 

25 30 
Val He Met Ser He Cys Thr Lys 
45 

Glu Ala Leu Cys Lys Ala Asn Phe 
60 

His Phe Ser Glu Lys Trp Asp Phe 
75 *" 80 

Glu Asp Met Met Ala Glu Lys Gin 

90 95 
Lys Tyr Thr Pro Asn Arg Asp Pro 
105 no 
Leu Gin His Gly Leu Leu Leu Trp 
125 

Leu His Gin Gly Arg Glu Lys Lys 
140 



<210> 319 
<211> 124 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<222> (1) . . . (124) 

<223> Xaa = X or * as defined in Table 6 



<400> 319 



Met 


Ser 


Arg Arg 


Lys 


Gin Gly Lys 


1 






5 




Phe 


Ser 


Pro Arg 


Asp 


Arg Glu Glu 






20 






Pro 


Pro 


Thr Pro 


Pro 


Gly Leu Val 






35 




40 


Arg Leu 


Gly Gin Pro 


Cys Ser Ala 




50 






55 


Asp Tyr 


Asp Pro 


Pro 


Glu Pro He 


65 








70 


His 


Thr 


His Ser 


His 


Ser Val Ser 








85 




Pro 


Gin 


Pro Leu 


Lys 


His Leu Gly 






100 






Val 


Arg 


Gly Met Met 


Gly Met Tyr 






115 




120 



Pro 


Gin 


His Leu Ser Lys Arg Glu 




10 


15 


Val 


Thr 


Thr Cys Phe Pro Cys Pro 


25 




30 


Thr 


Ser 


Pro Pro Ala Pro Arg Ala 






45 


Arg 


Asn 


Glu Asn Leu Leu Glu Ala 






60 


Val 


Leu 


Arg Asn Thr Thr Ala Thr 






75 80 


Pro 


Ser 


Leu Tyr Asn Ser Asp Ser 




90 


95 


Ala 


Val 


Ser Ala Ala Glu Thr Gly 


105 




110 


Leu 


Lys 


Pro Xaa 



<210> 320 

<211> 1067 

<212> PRT 

<213> Homo sapiens 



<400> 320 

Met Cys Glu Leu Asp He Leu His Asp Ser Leu Tyr Gin Phe Cys Pro 

15 io 15 

Glu Leu His Leu Lys Arg Leu Asn Ser Leu Thr Leu Ala Cys His Ala 
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20 








25 










30 




Leu 


Leu 


Asp 


Cys 


Lys 


Thr Leu 


Thr 


Leu 


Thr 


Glu 


Leu 


Gly Arg 


Asn Leu 






35 








40 










45 






Pro 


Thr 


Lys 


Ala 


Arg 


Thr Lys 


His 


Asn 


He 


Lys 


Arg 


He 


Asp 


Arg Leu 




50 








55 










60 








Leu 


Gly 


Asn 


Arg 


His 


Leu His 


Lys 


Glu 


Arg 


Leu 


Ala 


Val 


Tyr 


Arg Trp 


65 










70 








75 








80 


His 


Ala 


Ser 


Phe 


He 


Cys Ser 


Gly 


Asn 


Thr 


Met 


Pro 


He 


Val 


Leu Val 










85 








90 










95 


Asp 


Trp 


Ser 


Asp 


He 


Arg Glu 


Gin 


Lys 


Arg 


Leu 


Met 


Val 


Leu 


Arg Ala 








100 








105 










110 




Ser 


Val 


Ala 


Leu 


His 


Gly Arg 


Ser 


Val 


Thr 


Leu Tyr 


Glu 


Lys 


Ala Phe 






115 








120 










125 






Pro 


Leu 


Ser 


Glu 


Gin 


Cys Ser 


Lys 


Lys 


Ala 


His Asp 


Gin 


Phe 


Leu Ala 




130 








135 










140 








Asp 


Leu 


Ala 


Ser 


He 


Leu Pro 


Ser 


Asn 


Thr 


Thr 


Pro 


Leu 


He 


Val Ser 


145 










150 








155 








160 


Asp 


Ala 


Gly 


Phe 


Lys 


Val Pro 


Trp 


Tyr 


Lys 


Ser 


Val 


Glu 


Lys 


Leu Gly 










165 








170 










175 


Trp 


Tyr 


Trp 


Leu 


Ser 


Arg Val 


Arg 


Gly 


Lys 


Val 


Gin 


Tyr Ala 


Asp Leu 








180 








185 










190 




Gly 


Ala 


Glu 


Asn 


Trp 


Lys Pro 


He 


Ser 


Asn 


Leu 


His 


Asp Met 


Ser Ser 






195 








200 










205 






Ser 


His 


Ser 


Lys 


Thr 


Leu Gly 


Tyr 


Lys 


Arg 


Leu 


Thr 


Lys 


Ser 


Asn Pro 




210 








215 










220 








He 


Ser 


Cys 


Gin 


He 


Leu Leu 


Tyr 


Lys 


Ser 


Ai~cr 


Ser 


Lys Gly 


Arg Lys 


225 










230 








235 








240 


Asn 


Gin 


Arg 


Ser 


Thr 


Arg Thr 


His 


Cys 


His 


His 


Pro 


Ser 


Pro 


Lys lie 










245 








250 










255 


Tyr 


Ser 


Ala 


Ser 


Ala 


Lys Glu 


Pro 


Trp 


He 


Leu 


Ala 


Thr 


Asn 


Leu Pro 








260 








265 










270 




Val 


Glu 


lie 


Arg 


Thr 


Pro Lys 


Gin 


Leu 


Val 


Asn 


He 


Tyr 


Ser 


Lys Arg 






275 








280 










285 






Met 


Gin 


lie 


Glu 


Glu 


Thr Phe 


Arg 


Asp 


Leu 


Lys 


Ser 


Pro 


Ala 


Tvr Glv 




290 








295 










300 








Leu 


Gly 


Leu 


Arg 


His 


Ser Arg 


Thr 


Ser 


Ser 


Ser 


Glu 


Arg 


Phe 


Asp He 


305 










310 








315 








320 


Met 


Leu 


Leu 


He 


Ala 


Leu Met 


Leu 


Gin 


Leu 


Thr 


Cys 


Trp 


Leu 


Ala Gly 










325 








330 










335 


Val 


His 


Ala 


Gin 


Lys 


Gin Gly 


Trp 


Asp 


Lys 


His 


Phe 


Gin 


Ala 


Asn Thr 








340 








345 










350 




Val 


Arg 


Asn 


Arg 


Asn 


Leu Lys 


He 


Tyr 


Ser 


His 


Met 


Val 


Thr 


Leu Trp 






355 








360 










365 






Gly 


Asn 


Tyr 


Glu 


Gly 


He Ser 


Gin 


Thr 


Gin 


Ala 


Phe 


Ala Lys 


Glu Asn 




370 








375 










380 








Asn 


Gin 


Lys 


Ala 


Tyr 


Lys Glu 


Thr 


Tyr 


Gly 


Val 


Ser 


His 


He 


Thr Arg 


385 










390 








395 








400 


His 


Asp 


Met 


Leu 


Gin 


He Pro 


Lys 


Gin 


Gin 


Gin 


Asn 


Glu 


Lys 


Tyr Gin 










405 








410 










415 


Val 


Pro 


Gin 


Phe 


Asp 


Gin Ser 


Thr 


He 


Lys 


Asn 


He 


Glu 


Ser 


Ala Lys 








420 








425 










430 




Gly 


Leu 


Asp 


val 


Trp 


Asp Ser 


Trp 


Pro 


Leu 


Gin 


Asn 


Ala Asp 


Gly Thr 






435 








440 










445 






Val 


Ala 


Glu 


Tyr 


Asn 


Gly Tyr 


His 


Val 


Val 


Phe 


Ala 


Leu 


Ala 


Gly Ser 




450 








455 










460 








Pro 


Lys 


Asp 


Ala 


Asp 


Asp Thr 


Ser 


He 


Tyr 


Met 


Phe 


Tyr 


Gin 


Lys Val 


465 










470 








475 








480 


Gly 


Asp 


Asn 


Ser 


He 


Asp Ser 


Trp 


Lys 


Asn 


Ala Gly 


Arg Val 


Phe Lys 










485 








490 










495 


Asp 


Ser 


Asp 


Lys 


Phe 


Asp Ala 


Asn 


Asp 


Pro 


He 


Leu 


Lys Asp 


Gin Thr 








500 








505 










510 




Gin 


Glu 


Trp 


Ser 


Gly 


Ser Ala 


Thr 


Phe 


Thr 


Ser Asp 


Gly Lys 


He Arg 






515 








520 










525 






Leu. 


Phe 


Tyr 


Thr 


Asp 


Tyr Ser 


Gly 


Lys 


His 


Tyr Gly 


Lys 


Gin 


Ser Leu 
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530 










535 




Thr 


Thr 


Ala 


Gin 


Lys 


Ala 


Tyr 


Arg 


545 










550 






Gin 


Lys 


Asn 


Gly 


Ala 


Ala 


Asp 


Ala 










565 








Trp 


Ala 


Leu 


Gly 


His 


Gly 


Asp 


Asp 








580 










Ser 


Ser 


Gly 


Trp 


Pro 


Val 


Gly 


Phe 






595 










600 


He 


Leu 


Arg 


Ser 


Gin 


Val 


Lys 


Phe 




610 










615 




Cys 


Leu 


Gly 


Arg 


Gly 


Ser 


Leu 


Met 


625 










630 






Asn 


Gly 


Ser 


Phe 


He 


Phe 


Val 


Leu 










645 








Asp 


Thr 


Tyr 


Asn 


Ser 


Ser 


Asp 


Gin 








660 










He 


Glu 


Phe 


Gly 


Gly 


Ser 


Glu 


Leu 






675 










680 


Gly 


His 


Gin 


Lys 


Gly 


Leu 


Val 


His 




690 










695 




Ser 


Gly 


Ser 


Asp 


Pro 


Gin 


His 


Ala 


705 










710 






Arg 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 










725 








Leu 


Ser 


Leu 


Pro 


Val 


Pro 


Thr 


Ser 








740 










Tyr 


Pro 


Leu 


Leu 


Ala 


Gin 


Cvs 


Leu 






755 










760 


Gly 


Ser 


Gly 


Cys 


Gly 


Gin 


Glu 


Glv 




770 










775 




Ala 


Ala 


Gly 


Val 


Arcr 


Glu 


Gin 


Leu 


785 










790 






Pro 


Gly 


Asp 


Leu 


Pro 


Gly 


Gly 


Lys 










805 








Glu 


Gly 


Pro 


Gly 


Val 


Gly 


Ala 


Glu 








820 










Pro 


Ser 


Ser 


Pro 


Leu 


Pro 


Trp 


Leu 






835 










840 


Gin 


Phe 


Ala 


Glu 


He 


Gin 


Gly 


Ala 




850 










855 




Asp 


Ala 


Glu 


Arg 


Gly 


Arg 


Ala 


Lys 


865 










870 






Gly 


Arg 


Lys 


He 


Phe 


Asp 


Pro 


Asp 










885 








Ser 


Lys 


Val 


Leu 


Ser 


Thr 


Leu 


Leu 








900 










Gin 


Leu 


Ser 


Glu 


Arg 


Pro 


Cys 


Gly 






915 










320 


Asn 


Thr 


Ser 


Gin 


Thr 


Asn 


Pro 


Gin 




930 










935 




Gly 


Leu 


Gin 


Arg 


Gin 


Ser 


Lys 


Thr 


945 










950 






His 


Gin 


Leu 


He 


Val 


Lys 


Ala 


Arg 










965 








Asp 


Glu 


Leu 


Ser 


Val 


Cys 


Lys 


Gly 








980 










Glu 


Glu 


Gly 


Gly 


Trp 


Trp 


Glu 


Gly 






995 








1000 


Phe 


Pro 


Ser 


Asn 


Tyr 


Val 


Arg 


Glu 


1010 








1015 




Ser 


Pro 


Lys 


Ala 


Val 


Lys Gly Phe 


L025 








1030 






Tyr Tyr 


Thr 


Val 


Met 


Ser Arg 


Ser 



540 



Leu 


Glu 


He 


Val 


Ser 


Leu 


Glu 


Met 






555 










560 


Ala 


Pro 


Tyr 


Arg 


Gin 


He 


Glu 


Tyr 




570 










575 




He 


Lys 


Lys 


Ala 


Val 


Ala 


Phe 


Trp 


585 










590 






Ser 


Lys 


Met 


Glu 


Lys 


Ala 


Gly 


Lys 










605 








Pro 


Glu 


Tyr 


Met 


Glu 


Glu 


Ser 


Ser 








620 










Ser 


Leu 


Asn 


Asn 


Thr 


Ser 


Ser 


Ser 






635 










640 


Pro 


Leu 


Lys 


Leu 


Leu 


Arg 


Val 


Gly 




650 










655 




Ser 


Arg 


Met 


Ala 


Trp 


Arg 


Leu 


Thr 


665 










670 






His 


Leu 


Gly 


Val 


Arq 


Glu 


Glu 


Ala 










685 








Glu 


Ser 


Gly 


Asn 


Pro 


Ala 


Arcr 
•"-•-3 


Ser 








700 










Arg 


His 


Arg 


Gin 


Pro 


Ser 


Ala 


Thr 






715 










720 


Ala 


Ala 


Ala 


Ala 


Leu 


Pro 


Ala 


Pro 




730 










735 




Ala 


He 


Gin 


Val 


Arg 


Val 


Thr 


Ala 


745 










750 






Gin 


Ala 


Ala 


Phe 


Pro 


Pro 


Leu 


Leu 










765 








Thr 


Glv 


Ala 


Glv 


Glv 


Glv 


Glv 


Gly 








780 










Glu 


Asp 


Arq 


Arg 


Ala 


Ala 


Ala 


Glu 






795 










800 


Arcr 


Val 


Arc? 


Gly 


Arcr 


Gly 


Ala 


Arcr 




810 










815 




Gly 


Pro 


Pro 


Leu 


Glu 


Arcr 


Asn 


Arcr 


825 










830 






Ala 


Ala 


Pro 


Ala 


Ala 


Gly 
«* 


Ala 


Ser 










845 








Gly 


Lys 


Gly 


Glu 


Met 


Arcr 


Ala 


Lys 








860 










Leu 


Arg 


Gly 


Glu 


Leu 


Ser 


Ser 


Ser 






875 










880 


Asp 


Leu 


Tyr 


Ser 


Gly 


Val 


Asn 


Phe 




890 










895 




Ala 


Val 


Asn 


Lys 


Ala 


Thr 


Glu 


Asp 


905 










910 






Arg 


Ser 


Ser 


Ser 


Leu 


Ser 


Ala 


Ala 










925 








Gly 


Ala 


Val 


Ser 


Ser 


Thr 


Val 


Ser 








940 










Val 


Glu 


Met 


Thr 


Glu 


Asn 


Gly 


Ser 






955 










960 


Phe 


Asn 


Phe 


Lys 


Gin 


Thr 


Asn 


Glu 




970 










975 




Asp 


He 


He 


Tyr 


Val 


Thr 


Arg 


Val 


985 










990 






Thr 


Leu 


Asn 


Gly 


Arg 


Thr 


Gly 


Trp 








1005 








He 


Lys 


Ser 


Ser 


Glu 


Arg 


Pro 


Leu 






1020 










Glu 


Thr 


Ala 


Pro 


Leu 


Thr 


Lys 


Asn 




1035 








1040 


Leu 


Thr 


Ser 


Thr 


Val 


Leu 


Lys 


Asn 
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1045 1050 1055 

Ser Lys Val Ala Arg lie His Ser Lys Pro Tyr 
1060 1065 



<210> 321 
<211> 191 
<212> PRT 

<213> Homo sapiens 





<400> 321 










Arq 


Ser 


Pro 


Thr 


Leu 


Ser Ser Pro Pro Pro Ala Ser 


Lys Ala 


Gin Ala 


1 








5 


10 




15 


Leu 


Ala 


Leu 


Arg 


Ser 


Glu Ala Gin Ala Gin Met Pro 


jtva. y lie ui 


Pro Ala 








20 




25 


30 




Pro 


Arg 


Val 


Arg 


Arg 


Ser Ser Ala Ala Ala Ser Ala 




Arg o6r 






35 






40 


45 




Leu 


Ala 


Glu 


Thr 


Phe 


Ser Gly Lys Glu Cys Gin Trp 


Thr Asp 


Ala Cys 




50 








55 60 






Leu 


Ser 


His 


Pro 


Cys 


Ala Asn Gly Ser Thr Cys Thr 


Thr Val 


Ala Asn 


65 










70 75 




80 


Gin 


Phe 


Ser 


Cys 


Lys Cys Leu Thr Gly Phe Thr Gly 


Gin Lys 


Cys Glu 










85 


90 




95 


Thr 


Asp 


Val 


Asn 


Glu Cys Asp lie Pro Gly His Cys 


Gin His 


Gly Gly 








100 




105 


110 




Thr 


Cys 


Leu 


Asn 


Leu 


Pro Gly Ser Tyr Gin Cys Gin 


Cys Leu 


Gin Gly 






115 






120 


125 




Phe 


Thr 


Gly 


Gin 


Tyr 


Cys Asp Ser Leu Tyr Val Pro 


Cys Ala 


Pro Ser 




130 








135 140 






Pro 


Cys 


Val 


Asn 


Gly Gly Thr Cys Arg Gin Thr Gly 


Asp Phe 


Thr Phe 


145 










150 155 




160 


Glu 


Cys 


Asn 


Cys 


Leu 


Pro Glu Thr Val Arg Arg Gly 


Thr Glu 


Leu Trp 










165 


170 




175 


Glu 


Arg 


Asp 


Arg 


Glu 


Val Trp Asn Gly Lys Glu Pro 


Asp Glu 


Asn 








180 




185 


190 





<210> 322 
<211> 71 
<212> PRT 

<213> Homo sapiens 



<400> 322 



Met 


Pro Gly Pro 


Ala Ala Ala Ser His Arg Ala Ser 


Thr Tyr 


Val Ser 


1 




5 


10 




15 


Thr 


Trp Ser Cys 


Pro 


Pro His His Ser Trp His Ala 


Trp Gin 


Cys Thr 




20 




25 


30 




Val 


Ala Arg Pro 


His 


Leu Gin Thr Ser His Cys Cys 


Thr Ser 


Gly Leu 




35 




40 


45 




Pro 


Leu Ala Asp 


Met 


Glu Ser Arg Leu Val Ala Ser 


Pro Ser 


Glu Trp 




50 




55 60 






Asn 


Lys Leu Thr 


Trp 


Ala Gin 






65 






70 







<210> 323 
<211> 72 
<212> PRT 

<213> Homo sapiens 
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<400> 323 






Glu 


Ser Gin Ser 


Leu Glu Thr Gly 


1 




5 




Arg 


Lys Pro Gly 


Gly 


Pro val Leu 




20 






Ala 


Trp Ala Ser 


Arg 


Ser Pro Ala 




35 




40 


Pro 


Asn Leu Cys 


Cys 


Ser Pro Thr 




50 




55 


Asp 


Gly Pro Phe 


Val 


lie Arg Asn 


65 






70 



Leu Arg Ala Leu lie Trp Ser Thr 

10 15 
Gly Gly Leu Val Leu lie Lys Trp 

25 30 
Ser Pro Ser Asp Pro ser Pro Gly 
45 

Ser Pro Ala Thr Lys Pro Arg Val 
60 



<210> 324 
<211> 205 
<212> PRT 

<213> Homo sapiens 





<400> 324 






Met 


Asn 


Trp 


Val 


Leu Gin Lys Phe 


1 








5 




Tyr 


Arg 


Lys 


Ser 


Ser 


Asn Ser Ala 








20 






lie 


Glu 


Leu 


Asp 


Ala Gin Arg Pro 






35 






40 


Ala 


Ser 


Gly 


Ala 


His 


Pro Lys Lys 




50 








55 


Glu 


Glu 


Pro 


Ser 


Asp 


Gin Gin Ser 


65 










70 


Phe 


Gin 


Pro 


Lys 


Asp 


Gly Glu He 










85 




Val 


Asp 


Tyr 


Leu 


Val 


Lys Glu Gin 








100 






Ser 


Thr 


Leu 


Gly 


Met 


Ser Leu His 






115 






120 


Tyr 


He 


Leu 


Pro 


Gly Tyr Gin Pro 




130 








135 


Gin 


Val 


Asn 


He 


Asp 


Phe Gin Thr 


145 










150 


Gin 


Glu 


Pro 


Ser 


Leu 


Thr Cys Phe 










165 




Ser 


Leu 


Met 


Glu 


Lys 


Asp Ser Tyr 








180 






Tyr 


Leu 


Asp 


Leu 


Leu 


Ser Gin Ser 






195 






200 



He 


Thr Ala Trp Lys Phe 


Met Gly 




10 


15 


Arg 


Gly Ser Thr He Lys 


Glu His 


25 


30 




Val 


Arg Arg Ser Gly Pro 


He Gin 




45 




Gly 


Arg Gly Val Ser Cys 


Ser Val 




60 




Pro 


Ser Pro Pro Ser Pro 


Leu Thr 




75 


80 


Asn 


Phe Ser Val He Gly 


Gin Tyr 




90 


95 


Gly 


Val Lys Asn He Phe 


Gly Lys 


105 


110 




Val 


Ser Ser Ser Val Phe 


Arg Arg 




125 




Arg 


Gly His Thr Val Met 


Val Ser 




140 




Arg 


Glu Ala Thr Arg Lys 


Asn Leu 




155 


160 


Asp 


Gin Ala Gin Gly Lys 


Val His 




170 


175 


Pro 


Arg Phe Leu Arg Ser 


Lys Met 


185 


190 




Gin 


Arg Arg Leu Ser 





205 



<210> 325 

<211> 222 

<212> PRT 

<213> Homo sapiens 



<400> 325 
Met Val Met Ser Phe Val Lys Pro 

1 5 
Lys Lys Arg Asp Gly Glu Phe Asn 
20 

Pro Ala Arg Asp Thr Lys Arg Lys 



Gly Val Lys Glu Lys Glu Gin Val 

10 15 
Ser Glu His Ala Glu Leu Asp Val 

25 30 
Phe Trp Glu Pro Thr Arg Leu Ser 
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35 










40 




Ser 


Thr 


Leu 


Arg 


Thr 


Ser 


Ser 


Asp 


Pro 




50 










55 






lie 


Thr 


Met 


Val 


Cys 


Glu 


Pro 


Gly 


Ser 


65 










70 








Leu 


Thr 


Ala 


Gly 


Gly Ala 


Asn 


Val 


Trp 










85 










Lys 


Ser 


Arg 


Gin 


Leu 


Val 


Leu 


Arg 


Asn 








100 










105 


Pro 


Cys 


Gly 


Glu 


Leu 


Leu 


Pro 


He 


Leu 






115 










120 




Leu 


Leu 


Leu 


Gin 


Pro 


Val 


Asp 


Thr 


Glu 




130 










135 






Leu 


Gly 


Leu 


Asp 


Ala 


Glu 


Gly 


Pro 


Glu 


14 5 










150 








Leu 


Pro 


Cys 


Pro 


Gly He 


Asp 


His 


Thr 










165 










Asp 


Leu 


Tyr 


His 


Arg Lys 


Gly 


Asn 


Ser 








180 










185 


Thr 


Lye 


Asp 


Ala 


Arg 


Lys 


Glu 


Ser 


Phe 






195 










200 




Pro 


Leu 


Gin 


Ser 


Leu 


Ala 


Val 


He 


Tyr 




210 










215 








<210> 326 














<2ll> 680 














<212> PRT 














<213> Homo 


sapiens 










<400> 326 












Met 


Glu 


Glu 


His 


Ser 


Met 


Leu 


Met 


Gly 


1 








5 








Asn 


Gly 


Arg 


He 


Ala 


Gin 


Glu 


Leu 


Glu 








20 










25 


Tip 


Asn 


Gin 


Lys 


Arg 


Ala 


Cys 


He 


Thr 






35 










40 




Asn 


Lys 


Ala 


Gly 


Gly 


He 


Thr 


Leu 


Pro 




50 










55 






Ala 


Thr 


Val 


Thr 


Lys 


Thr 


Ala 


Trp 


Tyr 


65 










70 








Asp 


Gin 


Trp 


Asn 


Arg 


Thr 


Glu 


Pro 


Ser 










85 










Asn 


Tyr 


Leu 


He 


Phe 


Asp 


Lys 


Pro 


Glu 








100 










105 


Asp 


Ser 


Leu 


Phe 


Asn 


Lys 


Arg 


Phe 


Trp 






115 










120 




Arg 


Lys 


Leu 


Lys 


Leu 


Asp 


Pro 


Phe 


Leu 




130 










135 






Ser 


Arg 


Trp 


He 


Lys 


Asp 


Leu 


His 


Val 


145 










150 








Leu 


Glu 


Glu 


Asn 


Pro 


Gly 


lie 


Thr 


He 










165 










Asp 


Phe 


Thr 


Ser 


Lys 


Thr 


Pro 


Lys 


Ala 








180 










185 


Asp 


Lys 


Trp 


Asp 


Leu 


He 


Lys 


Leu 


Lys 






195 










200 




Thr 


Thr 


lie 


Arg 


Val 


Asn 


Arg 


Gin 


Pro 




210 










215 






Ala 


Thr 


Tyr 


Ser 


Ser 


Asp 


Lys 


Gly 


Leu 


225 










230 








Leu 


Lys 


Gin 


He 


His 


Lys 


Lys 


Lys 


Thr 
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45 



Leu 


Phe 


Ser 


Val 


Pro 


He 


Ser 






60 










Lys 


Ser 


Leu 


Gin 


Ser 


Cys 


Cys 




75 










80 


Glu 


Lys 


Ser 


Thr 


Cys 


Arg 


Lys 


90 










95 




Val 


Lys 


Val 


Pro 


Gly 


Lys 


Ser 










110 






Lys 


Lys 


Asn 


Gin 


Leu 


Asn 


lie 








125 








Thr 


Leu 


Glu 


Gly 


Pro- 


Pro 


Gly 






140 










Lys 


Arg 


His 


Ser 


Trp 


He 


Leu 




155 










160 


Ser 


Gly 


Leu 


Glu 


Val 


Met 


Ser 


170 










175 




Leu 


His 


Pro 


Gin 


Gly 


Lys 


Arg 










190 






Pro 


Gin 


Lys 


Met 


Gly 


Gin 


Phe 








205 








Pro 


Glu 


Ala 


Gly 


Thr 










220 










Arg 


Lys 


Asn 


Gin 


Tyr 


Arg 


Glu 


10 










15 




Lys 


Thr 


Thr 


Leu 


Lys 


Phe 


He 










30 






Lys 


Ser 


Asn 


Leu 


Ser 


Gin 


Lys 








45 








Asp 


Phe 


Lys 


Leu 


Tyr 


Tyr 


Lys 






60 










Trp 


Tyr 


Gin 


Asn 


Arg 


Asp 


He 




75 










80 


Glu 


He 


Met 


Pro 


His 


He 


Tyr 


90 










95 




Lys 


Asn 


Lys 


Gin 


Trp 


Gly 


Lys 










110 






Glu 


Asn 


Trp 


Leu 


Ala 


He 


Phe 








125 








Thr 


Pro 


Tyr 


Thr 


Lys 


He 


Asn 






140 










Arg 


Pro 


Lys 


Thr 


He 


Lys 


Thr 




155 










160 


Gin 


Asp 


Thr 


Gly 


Met 


Gly 


Lys 


170 










175 




Met 


Ala 


Thr 


Lys 


Ala 


Lys 


He 










190 






Ser 


Phe 


Cys 


Thr 


Ala 


Lys 


Glu 








205 








Thr 


Lys 


Trp 


Glu 


Lys 


He 


Phe 






220 










Thr 


Ser 


Arg 


He 


Tyr 


Asn 


Glu 




235 










240 


Asn 


Asn 


Pro 


He 


Arg 


Lys 


Trp 
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Ala 


Lys 


Asp Met 






260 


Lys 


Lys 


His Met 






275 


Gin 


He 


Lys Thr 




290 




He 


He 


Gin Lys 


305 






Ala 


Leu 


Ala Ala 


Leu 


Leu 


Gly Asp 






340 


Ala 


Ser 


Val Phe 






355 


Cys 


He 


Gly Pro 




370 




Pro 


Pro 


Glu Ala 


3B5 






Leu 


Lys 


Lys Phe 


Lys 


Leu 


Ala Leu 






420 


Leu 


Gly 


Gin Ala 






435 


Ala 


He 


His Cys 




450 




Gly 


Gly 


Ala Thr 


465 






Ala 


Lys 


Ala Gly 


Glu 


Thr 


Thr Thr 






500 


Arg 


Gin 


Gin His 






515 


Thr 


Thr 


Gly Val 




530 




Cys 


Gly 


His He 


545 






Ala 


Ser 


Val Tyr 


Arg 


Tyr 


Phe Arg 






580 


Pro 


Phe 


Gin He 






595 


Phe 


Thr 


He Gly 




610 




Ser 


Asp 


Thr Arg 


625 






Val 


Val 


Thr His 


Glu 


Ala 


Leu Ala 






660 


Thr 


Leu 


Gin Gly 



675 



245 

Asn Arg His Phe 

Lys Lys Cys Ser 
280 

Thr Met Arg Tyr 
295 

Ser Gly Asn Asn 
310 

Leu Trp Met Asp 

325 

Ser Glu Ser Phe 

His Ser Ala Leu 
360 

Gly Asp Ser Ser 
375 

Lys Arg Lys Asn 
390 

Ser Asp Asp Pro 
405 

Thr Phe Arg Leu 

Leu Leu Gin Glu 
440 

Arg Asn Asp Leu 
455 

Ala Val Pro Gin 
470 

He Trp Ala Arg 
485 

Gin Gin Pro Thr 

Asp Thr Asp Lys 
520 

Pro Val Ser Phe 
535 

Ser He Ser Trp 
550 

Gly He Ser Leu 
565 

Tyr Glu Arg Leu 

Phe Lys Ser Gly 
600 

Lys Val His Asn 
615 

Arg Ala Gin His 
630 

Pro Asn Asn Leu 
645 

Ala Ser Tyr Asn 

Ala Phe val Thr 
680 





250 






Ser 


Lys 


Glu 


Asp 


265 








Pro 


Ser 


Leu 


Ala 


His 


Leu 


Thr 


Ser 








300 


Arg 


Val 


Leu 


Pro 






315 




Pro 


Val 


Met 


Pro 




330 






Gin 


Gly 


Leu 


Ser 


345 








His 


He 


Asp 


Ser 


Ala 


Asp 


Ser 


Ser 








380 


Tyr 


Leu 


Leu 


Leu 






395 




Lys 


Arcr 


Leu 


He 




410 






lie 


Trp 


Lvs 


Asp 


425 








Glu 


Arcr 


Gin 


Thr 


His 


Leu 


Glu 


Asn 








460 


Leu 


Asp 


Pro 


Asn 






475 




Asn 


His 


Arcr 


Leu 




490 






Asn 


Ala 


His 


Ser 


505 








Pro 


Gin 


Pro 


Asn 


Leu 


Ala 


Phe 


Leu 








540 


Pro 


Val 


Val 


He 






555 




Ala 


Asp 


Arcr 


Lys 




570 






Arcr 


Thr 


He 


Lvs 


585 








Ser 


Val 


Val 


Lvs 


Thr 


Glu 


Val 


Thr 








620 


Leu 


Gin 


Thr 


Gly 






635 




Glu 


Asn 


Val 


Val 




650 






Lys 


Pro 


Phe 


He 



665 







255 




He 


Tyr 


Ala 


Ala 




270 






He 


Arcr 


Glu 


Met 


285 








Val 


Arcr 


Met 


Ala 


Leu 


Ala 


Pro 


Leu 








320 


Gly 


Met 


Asp 


Gly 






335 




Ala 


Thr 


Phe 


Phe 




350 






Ala 


Pro 


Gly 


Pro 


365 








Pro 


Thr 


Phe 


Leu 


Trp 


Arcr 


Lvs 


Asn 








400 


Glu 


Gly 


Phe 


Pro 






415 




He 


Asn 


Val 


Leu 




430 






He 


Cys 


Glv 


Ala 


445 








Ala 


Asn 




Pro 


Gin 


Asp 




Asn 








480 


Leu 


Cys 


Leu 


He 






495 




Pro 


Gin 


Thr 


Gin 




510 






Pro 


Pro 


Ala 


Lys 


525 








Tvr 


Gin 


Tvr 


Leu 


Leu 


Lvs 


Tvr 


Ala 








560 


Arg 


Gin 


Tyr 


Asp 






575 




Pro 


Asn 


Phe 


Leu 




590 






Leu 


LVS 


Ala 


Glv 


605 








Ala 


Leu 


Lys 


Val 


Cys 


Trp 


Ser 


Ala 








640 


Arg 


His 


Pro 


Pro 






655 




Cys 


Ser 


Leu 


Val 




670 







<210> 327 
<211> 371 
<212> PRT 

<213> Homo sapiens 



216 



WO 01/74836 



PCT/US01/10472 



<400> 327 



Met 


Leu 


Met 


Val Tyr 


Pro Arg Thr 


1 






5 




Trp 


Lys 


Val 


Glu Pro 


Pro Thr Pro 








20 




Ala 


val 


Thr 


Thr Ser 


Ser Ser Ser 






35 




40 


Lys 


Val 


Pro 


Thr Thr 


Lys Ser Thr 




50 






55 


Lys 


Asp 


Gin 


Pro Asp 


Gly Ser Ser 


65 








70 


Gin 


Ser 


Gin 


Pro Pro 


Ala Ala Ser 








85 




Ser 


Lys 


Glu 


Glu Glu 


Ser Ala Met 








100 




Arg 


He 


Pro 


Ser Thr 


Pro Asn His 






115 




120 


Phe 


Glu 


Ala 


Leu Asp 


He Glu Lys 




130 






135 


Val 


Gly 


Pro 


Ser Pro 


Ser Ser Asp 


145 








150 


Arg 


Ala 


Phe 


Pro Ser 


Lys Gin Asp 








165 




Ala 


Pro 


Leu 


Pro Asp 


Ala Ser Ala 








180 




Gin 


Val 


Thr 


Gly Gin 


Glu Gly Gin 






195 




200 


Val 


Leu 


Lys 


Ala Thr 


Gin Glu Ala 




210 






215 


Tyr 


Ser 


Leu 


Pro Gly 


Thr Leu Ala 


225 








230 


Gly 


Thr 


Met 


Asn Phe 


Leu Asp Ser 








245 




Arg 


Asn 


Leu 


Gly Thr 


Ala Cys Asn 








260 




Pro 


Glu 


Gly 


Pro Leu 


Pro Arg Ser 






275 




280 


Arg 


Pro 


Ala 


Trp Ser 


Lys Leu Ala 




290 






295 


Pro 


Arqr 


Gin 


Arg Gly 


Ser Thr Glu 


305 








310 


Asp 


Pro 


Leu 


Ala Ala 


Ala Pro He 








325 




Ala 


Leu 


Ala 


Ser Trp 


Leu Ala Ser 








340 




Arg 


Pro 


Pro 


Thr Pro 


Asp Cys Arg 






355 




360 


Pro 


Thr 


Ser 








370 









<210> 328 
<211> 117 
<212> PRT 

<213> Homo sapiens 



<400> 328 
Met Gly Lys Trp Asp Gly Gly Phe 

1 5 
Glu Asn Thr Cys Ser Leu Asn Arg 
20 

Ala Thr Leu Lys He Ser Trp Gly 



Asn 


Lys 


Gin Asn Gin Lys 


Lys 


Lys 




10 




15 




Gin 


Glu 


Pro Gly Pro Ala 


Lys 


Val 


25 




30 






Ser 


Ser 


Ser He Pro Ser 


Ala 


Glu 






45 , 






Leu 


Trp 


Gin Glu Glu Met 


Arg 


Thr 






60 






Leu 


Ser 


Pro Ala Gin Ser 


Pro 


Ser 






75 




80 


Ser 


Leu 


Arg Glu Pro Gly 


Leu 


Glu 




90 




95 




Ser 


Ser 


Asp Arg Met Asp 


Cvs 


Gly 


105 




110 






Arg 


Arg 


Ser Gin Val lie 


Glu 


Lvs 






125 






Ala 


Glu 


His Met Glu Thr 


Asn 


Ala 






140 






Thr 


Arcr 


Gin Gly Arg Ser 


Glu 


Lvs 






155 




160 


Phe 


Thr 


Asn Glu Ala Pro 


Pro 


Gin 




170 




175 




Ser 


Pro 


Val Ser Thr Pro 


Lvs 


Ser 


185 




190 






Asn 


He 


Ser Trp Asp Met 


Ala 


Val 






205 






Pro 


Ala 


Ala Ser Thr Leu 


Glv 


Ser 






220 






Lvs 


Ser 


Glu He Leu Glu 


Thr 


His 






235 




240 


Ser 


Pro 


Gin Val Arg Tyr 


His 


Pro 




250 




255 




Gin 


Ala 


Gly Leu Asp Arg 


Tyr 


Tyr 


265 




270 






Gly 


Asp 


Asp Thr Val Leu 


Ser 


Pro 






285 






Ala 


Thr 


Arg Ser Gin Arg 


Leu 


Asp 






300 






Arg 


Glu 


Lys Gly Glu Ala 


His 


Asn 






315 




320 


Ser 


Ala 


Ala Gly Ser Arg 


Arg 


Gly 




330 




335 




Pro 


Thr 


Arg Ser Gin Asn 


Pro 


Ala 


345 




350 






Ala 


Pro 


Leu Thr Glu Ser 


Ala 


Arg 






365 






Asn 


Phe 


Met Val Phe Cys 


Val 


Leu 




10 




15 




Ser 


Leu 


Leu Glu Asn Leu 


He 


Phe 


25 




30 






Gly 


Ala 


Gly He Lys Gly 


Trp 


Asn 
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35 40 



His 


Met 


Lys 


Ala 


Gin 


Leu 


Leu 


Thr 


Cys 




50 










55 






Gin 


Leu 


Ser 


Pro 


Gin Gly 


Ser 


Leu 


Glu 


65 










70 








Pro 


Glu 


Leu 


Thr 


Gly Pro 


He 


Glu 


Leu 










85 










His 


Leu 


Ser 


Asn 


Leu 


Leu 


Ser 


Gin 


Thr 








100 










105 


Leu 


His 


Gin 


Asn 


His 














115 
















<210> 329 














<211> 256 














<212> PRT 














<213> Homo 


sapiens 










<400> 329 












Pro 


He 




Ala 




Arg 


Cys 


Tyr 


Gly 


1 








c 
j 










Leu 


Glu 


Leu 


Pro 


i-ixy 


cys 


Asp 


Gly 


Pro 








20 










25 


Ala 


Pro 


Lys 


Met 


Ala 


Ala 


Ser 


Leu 


Leu 






35 










40 




Ala 


Ser 


Trp 


He 


Pro 


Arg 


Asn 


Gly 


Thr 




50 










55 






He 


Ala 


Leu 


Leu 


Glu 


Ala 


Val 


Gly 


His 


65 










70 








Ala 


Thr 


Ser 


Ala 


Ala 


Ser 


Val 


He 


Ser 










85 










Gin 


Phe 


Pro 


Lys 


Leu 


Pro 


Leu 


Gly 


Glu 








100 










105 


Ser 


Tyr 


Trp 


Ser 


Ser 


Asn 


Pro 


Thr 


Lys 






115 










120 




Arg 


Val 


Ala 


Val 


Phe 


Ala 


Arg 


He 


Leu 




130 










135 






His 


Phe 


Ala 


Val 


Phe 


Tyr 


Asp 


Thr 


Phe 


145 










150 








Gly 


Gin 


Arg 


Asp 


Ala 


Cys 


Gly 


Arg 


Leu 










165 










Gin 


Glu 


Gly 


Arg 


Lys 


Gly 


Phe 


Thr 


Ser 








180 










185 


Lys 


Lys 


Arg 


Thr 


Lys 


Arg 


Ser 


Val 


Leu 






195 










200 




Lys 


Lys 


Ser 


Pro 


Glu 


Ser 


Pro 


Gin 


Gly 




210 










215 






Leu 


Arg 


Ser 


Thr 


Val 


Lys 


Phe 


His 


Lys 


225 










230 








Glu 


Val 


Glu 


Lys 


Ser 


Ser 


Val 


Ala 


Arg 



245 



<210> 330 
<211> 96 
<212> PRT 

<213> Homo sapiens 
<220> 
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45 








Val Val 


Phe 


Asp 


Ala 


Gly 


Phe 




60 










Gin Lys 


His 


Leu 


He 


Leu 


Tyr 


75 










80 


Asp Pro 


He 


Cys 


Tyr 


His 


His 


90 








95 




Pro Lys 


Ala 


Ser 


Phe 


Glu 


Val 








110 






Gly Trp 


"hirer 


Gly 


S er 


Leu 


Phe 


10 








15 




Leu Ser 


-»xy 




Gly 


lip 


Til a 
Aid 














Phe Ser 


Asp 


Gin 


Gly 




LSU 






45 








\JL \j _l y 


His 


Ala 


Met 


ser 


vjiy 




60 












Glu 


Leu 




Asn 


Pro 


75 










on 


-rtoll JLXJLJL 


Car 


ser 


ASp 


Ala 


Asn 


90 








95 




He Ser 


Pro 


Asp 


T 

ijys 


Glu 


Jjys 








110 








Pro 


Lys 


Glu 


Thr 


J. Ill 






125 








Asp Val 


Asn 


Asp 


Asn 


Ala 


Pro 




140 










Val Cys 


Glu 


Asn 


Ala 


Aiy 


JcX \J 


155 










160 


Glv Ast> 


xxp 


Asp 


Ser 


Glu 


Lys 


170 








175 




Thr Asn 


Ser 


Val 


He 


Leu 


His 








190 






Gly Gly 


Asp 


Lys 


Phe 




Gin 






205 








Asp Phe 


Gin 


Asp 


Ser 


Arg 


Ser 




220 










Ser Thr 


Arg 


Lys 


Leu 


Gin 


Gly 


235 










240 


Asn Tyr 


Gly 


Phe 


Asp 


Pro 


Leu 


250 








255 
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<221> mis cofeature 
<222> (1) . . . (96) 

<223> Xaa = X or * as defined in Table 6 



<400> 330 



Gly 


Cys 


Leu 


Glu His 


Glu 


Ala 


Ser 


1 






5 








Leu 


Cys 


Ala 


Leu Leu 


Asp 


Met 


Tyr 








20 








Thr 


Leu 


Gin 


Pro Met 


Gly 


Gin 


Pro 






35 








40 


Thr 


Arg 


Glu 


Ser Thr 


Gly 


Phe 


Met 




50 








55 




Ser 


Leu 


He 


Cys His 


Glu 


Thr 


Trp 


65 








70 






Met 


His 


Arg 


Gly Arg 


Tyr 


Trp 


Ala 



85 



Ser Ala Tyr Glu Trp Leu Trp Ser 

10 15 
Thr Ala Gly Pro Thr Lys Thr Gin 

25 30 
Asn Leu Lys Gly Asp Gly Gly Phe 
45 

Gin Leu Pro Ala Asp Phe He Ser 
60 

Val Pro Gly Lys Pro Ser Thr Ala 
75 80 
Glu Pro He Met Leu Pro Lys Xaa 
90 95 



<210> 331 

<211> 142 

<212> PRT 

<213> Homo sapiens 



<400> 331 

Met Ser Glu Lys Asn Thr Pro Leu Val Leu Ser Gly Glu Asn Gin Lys 

15 10 15 

Lys Gly Arg Glu He Gly Val Cys Arg Lys Gin Ser Gin Cys Asp His 

20 25 30 

Gin Asp Asn Asn Ser His Thr Leu Arg Phe Ser Ser Tyr Ser Ser Ser 

35 40 45 

Ser Gly Pro Val Thr Leu Val Ser Phe His Ser His Asn Tyr Pro Ser 

50 55 60 

Lys Val Leu Leu Gin Gly Asn Leu Asp Thr Glu Thr Cys Thr Glu Arg 
65 70 75 80 

Arg Gin Arg Glu He Trp Thr Gin Arg His Glu Gly Lys Cys Gly His 

85 90 95 

Arg Asp Met Tyr Arg Glu Lys Thr Lys Arg Lys Tyr Arg Glu Lys Ala 

100 105 110 

He Tyr Lys Leu Arg Lys Gly Pro Glu Thr Asp Pro Ser Ser Gin Pro 

115 120 125 

Ser Glu Arg Thr Asn Pro Ala Asn Thr Leu He Ser Asp Ser 
130 135 140 



<210> 332 
<211> 424 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (424) 

<223> Xaa =* X or * as defined in Table 6 



<400> 332 

Thr He Ser Trp Arg Gin Gly Arg Gly Gly Glu Ala Gly Arg Arg Leu 
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1 






5 










10 








15 


Trp His 


Thr 


Pro 


Pro 


Gin 


Val 


Lys 


His 


Leu 


Glu Pro 


Gly 


His 


Pro Glu 






20 










25 








30 




Gin Gin 


Ala 


Arg 


Lys 


Ala 


Gin 


He 


Gin 


Gly 


Gin Pro 


Pro 


Ala 


Pro Gly 




35 










40 








45 






Trp Asp 


Leu 


Thr 


Gin 


Gly 


Glu 


Gin 


Ala 


Gly 


Asn He 


His 


Gin 


His Cys 


50 










55 








60 








Gly Gin 


Ser 


Arg 


Gly 


Xaa 


Gin 


Gin 


His 


Gin 


Lys Asp 


Pro 


Xaa 


Gly His 


65 








70 










75 






80 


Arg Phe 


Ala 


Glu 


Gly 


Met 


Gin 


Ser 


Leu 


Ser 


Lys Glu 


Leu 


Gin 


Ser Asp 








85 










90 








95 


Xaa Thr 


Ser 


Arg 


Lys 


Gly 


Gly 


Phe 


Arg 


Val 


Pro Cys 


Ser 


His 


Asn Arg 






100 










105 








110 




Glu Pro 


Pro 


Thr 


Arg 


Pro 


Gly 


Asp 


Pro 


Cys 


Asp Ser 


Pro 


Ala 


Gly Leu 




115 










120 








125 






Gly Leu 


Gin 


Glu 


Cys 


Arg 


Ala 


Arg 


Tyr 


Arg 


Pro Gly 


Lys 


Pro 


Ser Ser 


130 










135 








140 








Pro Pro 


Arg 


Gly 


Gin 


Ser 


Arg 


Ala 


Thr 


Gly 


Pro Val 


Arg 


Trp 


His Pro 


145 








150 










155 






150 


Ser Pro 


Ser 


Arg 


Asn 


Xaa 


Gly 


Pro 


Pro 


Gly 


Ser Arg 


Pro 


Ala 


Pro Gly 








165 










170 








175 


Thr Asp 


Pro 


Ala 


Pro 


Gly 


Arg 


Pro 


Pro 


Gly 


Arg Pro 


Leu 


Ala 


Ala Ser 






180 










185 








190 




Gly Leu 


Leu 


Pro 


Asn 


Ser 


Pro 


Pro 


Ala 


Pro 


Gly Ser 


Pro 


Gin 


Gly Pro 




195 










200 








205 






Pro Pro 


Pro 


Arg 


Gly 


Ser 


Asn 


Arg 


Pro 


Arg 


Phe Pro 


His 


Trp 


Leu Arg 


210 










215 








220 








Arg Pro 


Ala 


Gly 


Arg 


Gly 


Ala 


Pro 


Cys 


Xaa 


Pro Gin 


Pro 


Arq 


Ser Pro 


225 








230 










235 






240 


Gin Gin 


His 


He 


Pro 


Glu 


His 


Arg 


Thr 


Lys 


Pro Val 


Pro 


Ala 


Pro Glu 








245 










250 








255 


Pro Pro 


Ser 


Gly 


Ser 


Arg 


Asn 


Thr 


Asp 


Pro 


Pro Gly 


Gin 


Pro 


Arg Ala 






260 










265 








270 




Arg Gly 


Thr 


Trp 


Lys 


Ala 


Ser 


Pro 


Gly 


His 


Arg Ala 


Asp 


Ser 


Ala Ser 




275 










280 








285 






Arg Arg 


Ala 


Ser 


Phe 


Leu 


Phe 


Arg 


Cys 


Leu 


Ala Asn 


Leu 


Gin 


Arg Ser 


290 










295 








300 








Leu Lys 


Gin 


Met 


Arg 


Gly 


Lys 


Leu 


His 


Ser 


Gin Lys 


Ala 


Gin 


Phe Trp 


305 








310 










315 






320 


Phe lie 


Leu 


Asn 


Gly 


Phe 


He 


Gly 


Gly 


Val 


He Gly 


Arg 


Arg 


Met Thr 








325 










330 








335 


Asp Cys 


Gin 


Ala 


Cys 


Glu 


Pro 


Arg 


Leu 


Arg 


Ser He 


Gin 


Cys 


Gin Leu 






340 










345 








350 




Pro Glu 


Ser 


Tyr 


Thr 


Ser 


Leu 


Cys 


His 


Pro 


Ala Ala 


Leu 


Thr 


Gin Ser 




355 










360 








365 






Gly Pro 


Lys 


Asn 


Val 


Leu 


Glu 


Arg 


Asp 


Gin 


Pro Ser 


Ala 


Cys 


Ser Leu 


370 










375 








380 








Lys Thr 


Pro 


Ala 


Gin 


Thr 


Cys 


Leu 


Pro 


Gin 


Cys Ser 


Leu 


His 


Trp Thr 


385 








390 










395 






400 


Leu Arg 


Asp 


Asp 


Gin 


Thr 


Gin 


Pro 


Leu 


Thr 


Ala Pro 


Ser 


Ser 


Thr Met 








405 










410 








415 


Asn Gly 


Ala 


Tyr 


Arg 


Met 


Lys 


Cys 















420 



<210> 333 

<211> 49 

<212> PRT 

<213> Homo sapiens 



<400> 333 

Pro Leu Val Val Cys Leu Leu Glu Phe Tyr Cys Thr His Leu Arg Asp 
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1 5 
Gly Leu Asn Ser Val Gin Leu Ala 
20 
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Thr Cys Val Arg Met Cys Leu Cys 
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Val Cys Val Cys Gly Gly Cys Lys 
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